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To all whom it may concern: 

Be it known that, we 5 Ulrich Laeinitili and Samuel Janssen 

have invented certain new and useful improvements in 

LINKED SEQUENCE-SPECIFIC DNA-BINDING MOLECULES 

of which the following is a full, clear and exact description. 
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T.lnkefi. sequence -gpeei fie DNA-bindirtgr molecules 




The present invention relates to tandemly linked, eecjuence- 
specific DKA-bindin9 molecules with high affinity, specificity 
and- binding-site size. The invention also relates to the in 
vivo, in vitro and ex vivo use of the tandemly linked binding 
moleculea for binding DNA in a i^equence -specif ic manner, and for 
regulating chromosome and gene function. The invention alao 
concerns the sequence-specific marking of DNA and chromosomes 
using marked, tandemly linked binding molecules. 

Small synthetic molecules that can target predetermined DNA 
sequences with high affinity and specificity could represent a 
major breakthrough in molecular biology. Binding of these 
molecules could serve to locally interact with proteins as well 
ae to deliver a conjugated chemical group such as a fluorescent 
label, a toxin, or a peptide. 

^ Recently, considerable progress has been made in the 
synthesis of small molecules composed of heterocyclic organic 
molecules for example aromatic amino acids such as N- 
methylpyrrole (Py) and N-methylimidazole (Im) . These molecules 
can bind specific DNA sequences with remarkeible affinities 
(Geierstanger et al . , 1994). The pseudo -peptides (polyamides) , 
based on the structure of the naturally occurring antibiotic 
distamycin, bind DNA in the minor groove as antiparallel dimers 
{Pelton and Wemmer, 1969) . 

The sequence-specificity of these compounds depends on the 
aide-by-side pairing of this dimer, where for example, an im 
opposite a Py (Im/Py) targets a G-C base pair, a Py/Im 
recognizea a C-G base pair and a Py/Py pair (or Py alone) is 



degenerate for both A,T or T.A base pairs (WUite et al - , 1997). 
Compounds cotnpoBed of N-methylpyrrole , N-methylimidazole , 
tnethyl-3-tiydroxypyrrole and certain aliphatic amino acidg can 
therefore be designed in such a way that the position of these 
unita in the mostly linear compound determines the sequence of 
base pairs to which the compound will bind in the minor groove. 

Specificity (and affinity) of targeting increases as the 
binding site size of the compound increases. Currently however, 
it is difficult to produce compounds that target a sequence that 
is longer than 5-7 base pairs since with increasing size, the 
mistnatch between these compounds and the DNA also increases. 

For example, each pyrrole carboxaraide contacts one AT base 
pair. To enlarge binding site size and improve affinity, the 
nuinber of N-methylpyrrole units can therefore be increased. 
However, for compounds containing more than six pyrroles this 
prediction is no longer valid since the molecule becomes out of 
phase with the base paira along the minor groove floor. In fact, 
the pyrrole-pyrrole^ distance is about 20% longer than required 
for perfect match (Goodsell and Dickerson, 1986) . In addition, 
cotRpounds with five or more pyrrole rings are found to be over- 
bent relative to the pitch of the DNA helix resulting in 
decreased binding affinities for longer oligopyrroles (de 
Clairac et al . , 1999). 

To circumvent this mismatch problem, a flexible amino acid 
(p-alanine) can be introduced in the center of the pyrrole ring 
system to restore register of the recognition elements and relax 
the curvature of these crescent -shaped molecules. 



Attempts have been made to increase the size of the binding 
sites of these DNA-binding molecules. For example two netropsin 



or two distamyciTi molecules have been joined together to form 
dimers using a variety of different linlcerB to achieve binding 
sites of 8 to 10 bases (Neamati et al - . 1998, Wang and Lown. 
1992). Furthermore, it haa been proposed to tether together 
polyamides of the hairpin type using p-alanine or 5-aminovaleric 
acid (international patent application WO 98/45284). However, 
none of these proposed atructures have' provided satiafactory 
specificity. 

It is an object of the present invention to provide DNA- 
binding molecules with high specificity and affinity for in vivo 
and in vitro use. 

Molecules meeting thia objective and which can be aeen to be 
highly iTnproved tandem linked DNA binding elements have been 
developed. 

The inventors have established that the nature of the link 
between the DNA-binding elements (or -module a " ) and the relative 
orientation of the linked elements are important factors in the 
proper functioning of each module. The characteristics which the 
linker must exhibit in order to achieve the above objective have 
been identified and are described below. 

The invention thus relates to tandem linked highly eequence- 
epecific DNA-bindino molecules . 

More particularly, the invention concerns a DNA binding 
molecule, capable of sequence specific binding to the minor 
groove of double -stranded DNA, characterised in that it 
^^^^igeg at least two sequence specific DNA-binding elements, 
covalently linked by an amphipathic, flexible linker molecule, 



at least one of said DNA-binding elements being non- 
proteinaceouB . 

in acc:ordance with the invention, each DNA binding element 
alone n>ay have relatively low specificity and affinity, but 
covalently linked to each other using an amphipathic. flexible 
linker. a cotnpound is obtained that by far exceeds the 
specificity and affinity of the individual dna binding elements. 

The inventors have found that covalently linked 
oligopyrroles in accordance with the invention efficiently 
provide specificity for sequences as long. as 15-18 base pairs- 

* The inventors have demonstrated the excellent specificity 
and affinity of the compounds of the' invention firstly by 
targeting « SARa » (scaffold associate regions) which are 
candidate cis^acting regions of chromosome dynamics. The 
sequence hallmark of SARS are numerous AT-tracts (abort motifs 
of A and T bases) that are generally separated by short, mixed 
sequence spacers, resulting in clustered AT-tracts (Adachi et 
al., 1989; Bode et al., 1992; Kas and Laemmli, 1992) 

This approach has also been further extended to target 
sequences containing all four Watson-Crick base pairs by the use 
of so-called tandem hairpin molecules that have little or no 
base degeneracy, composed of predominantly heterocyclic building 
blocks which are positioned opposite to each other with each 
unit recognizing one single base, 
t 

According to the invention, the lin)cer which links the DNA- 
binding units is an amphipathic flexible linker molecule. 



m the context of the invention, amphipathic means that the 
linker molecule has both polar and non-polar parte. The non- 
polar part i3 water- in3oluble and is thus hydrophobic (or 
lipophilic) and soluble or miscible with non-polar solvents. The 
polar part is water-soluble and is thus hydrophilic. 

* Steps in the interaction process of a DNA minor-groove 
binding element involve a transfer from the aqueous solution 
surrounding the DMA into the hydrophobic environment of the 
minor groove. If the ligand is positively charged, counter ions 
territorially bound to the DNA will be released. In the minor 
groove, the element can form a variety of interactions, 
including hydrogen * bonds and Van der Walls' interactions, 
specificity of binding to a target sequence of the element in 
the minor groove is based on molecular complementarity of the 
recognition units of the moiety and the bases of the DNA target. 

According to the invention, the tethering of said elements 
with an amphipathic. flexible linker serves to promote the bi- 
or mult i -dentate energetically favourable interaction of the 
multiple elements with the DNA strand. The atnphipathic nature of 
thfe linker increases the water solubility of the DNA-binding 
molecule. This property of the linker enables unbound or 
unfavourably bound elements to "escape" from the hydrophobic 
environment of the minor groove into the aqueous solution 
surrounding the DNA, to then reach DNA targets where specific 
energetically favourable interactions can occur. 

According to the invention, the linkeir ia necessarily at 
least bifunctional, i.e. it comprises at least two functional or 
"reactive" groups through which the link between two tandemly 
oriented DNA-binding elements is established. Preferably, but 
not necessarily . the linker is heterobif unctional, meaning that 



the linker molecule contains at least two different reactive 
groups. These groups are usually, but not always, at the 
extremities of the linker molecule. 

Examples of suitable functional groups are amino, carboxyl , 
thiol, haloacetyl, aldehyde, amino-oxy, maleimide groups, a 
symetrical anhydride and halogen atoms. Particularly preferred 
are amino and carboxyl groups. In such a cslbc, the C- terminus of 
the linker is bound to the N-terminus of a first DNA-binding 
element and the N- terminus of the linker is bound to the C- 
terminue of the next DNA-binding element. 

The DNA-binding elements are linked in a tandem manner, i.e. 
consecutive DNA-binding elements are ^ linked in the same 
orientation with respect to each other, for example in a head- 
to-tail configuration, in the case of DNA-binding elements which 
have amino and carboxy termini, for example pseudopeptide 
polyamide molecules, the amino terminus of a first DNA-binding 
element is tethered via the linker to the carboxy- terminus of a 
second DNA-binding unit. The individual DNA-binding elements are 
thus all oriented i?n the same direction, greatly facilitating 
the binding of the molecule to the DNA* In the context of the 
invention, "tandem" means in the same orientation, and 
"inverted" means in opposite orientation. 

The DNA-binding molecule thus binds in a multidentate mode 
to a given strand of DNA. in other words , the DNA-binding 
molecules of the invention are composed of DNA-binding or 
elements separated by linkers which are essentially devoid of 
this capacity to bind the minor-groove of DNA. All the elements 
in a given DNA-binding molecule bind in tandem orientation to a 
given strand of DNA. For DNA-binding molecules having amino and 
carboxy termini, the binding to the DNA is normally in the 



"parallel" orientation, i.e. the DNA-binding molecule binds in 
an N->C direction parallel to the 5»-> 3' direction of the DNA. 

The linker may or may not be involved in DNA- interactions , 
For example, the linker may contain positively charged groups 
which interact with the phosphate backbone of the DNA. The 
liiiJcer may alao include a DNA-intercalating side group- 
According to a preferred variant the linker does not contain any 
element which has DNA-binding properties* 

The linker moleculeo used according to the invention are 
preferably non- immunogenic and non- toxic and have increased 
resistance to protec^lytic degradation. They are preferably non- 
self aggregating, and do not have long atretchea of methylene 
groups, i.e. 3 or more methylene groups, thereby reducing strong 
van der Walls' interactions- 

According to a preferred variant of the invention, the 
linker in the general formula (!) below, is represented by (Dm 
wherein "m" repreeents an integer having a value equal to, or 
greater than one. In particularly preferred variants, "m" has 
th^ value 1, According to other preferred variants, "m" has a 
value greater than 1, for example 2 to 10, or 3 to 8; and the 
amphipathic linker (L) „, thus comprises an assembly of linker 
sub-unitB CD . In such a case, the asaembled linker ID^n has an 
overall amphipathic character, and at least one (L) sub-unit is 
amphipathic. Preferably, more than one linker sub-unit, and most 
preferably all linker sub-unite are individually amphipathic- 

The total length of the linker (Dm is generally speaking 
between 5 to 250 angstroms, for example 5 to 50 Angstroms. This 
corresponds to a length Of approximately 4 to 42 interatomic 
bonds. The number of linker sub-units (Ii) can be multiplied to 



m 3 , a 



achieve a length correBponding to the number of DNA bases to be 
spanned. . 

Examples of suitable linkers are molecules comprising one or 
more polar groups such as ether groups and/or eater groups for 
example molecules derived from ethylene oxide or propylene 
oxide. Derivatives of ethylene oxide {CH2CH2O) are particularly 
preferred, for example oligomers of ethylene oxide having 
functional groups at the extremities. Such derivatives are 
schematically represented by the following structure : 

where and represent any functiona.l groups, for example 
those listed above, and may be the same or different, and "n" 
may have a value from 1 to 20, for example 1 to 10, or 1 to 5. 

Oligoglycine (NH-CH3-CO) p can also be used ae an amphipathic 
lin3cer of the invention. A particularly preferred example is a 
linker comprising one or more units of 8-amino-3 , 6- 
dioxaoctanoic acid (>\o) . 

The linker may also contain residues which are not directly 
involved in linking, for example residues for chain conversion 
3uch as glutamic acid or succinic anhydride. 

At least one, and preferably all of the DNA-binding elements 
of the molecule of the invention are non-proteinaceous . In the 
context of the invention, "non-proteinaceous" means that a given 
DNk-binding element is composed, preferably but noc necessarily 
exclusively, of non-naturally-occurring amino acids . Non- 
natural ly- occurring amino acids are amino- acids other than those 
used by living eel Is to make proteins , for example organi c 



heterocyclic amino acids such as pyrrole, imidazole, triazole 
etc. 

The DNA- binding molecule of the invention thus compriBes a 
plurality of DNA-binding elements linked to each other with an 
arnphipath-ic linker. According to a preferred embodiment, at 
least one of the DNA-binding elements of the molecule of the 

8 

invention comprises an oligomer containing one or more organic 
heterocyclic amino acid residues. Such molecules are known as 
"polyamide" DNA-binding molecules, or ^^pseudopeptidea" . 

Particularly preferred organic heterocyclic amino acid 
residues are those having at least one annular nitrogen, sulphur 
or oxygen, such a'^ pyrrole, imidazole, triazole, pyrazole, 
fiiraH/ thiazole, thiophene, oxazole, pyridine. The organic 
heterocyclic residues may also be derivatives of any of these 
compounds wherein one or more of the heteroatoms are aubatituted 
by a substituent which is DNTA-binding or non -DNA-binding , 
Examples of DNA-binding aubstituents are pyrrole, imidazole etc 
as listed above. 

According to a particularly preferred embodiment, at least 
onfe DNA-binding oligomer includes heterocyclic residues chosen 
from N-methylpyrxole (Py) and /or 3 -hydroxy N-methylpyrrole (HP) 
and/or N -methyl imidazole (Im) . 

The DNA-binding element may further comprises at least one, 
for example 2, 3 or 4 aliphatic amino acid residue such as a p- 
alanlne (p) residue^ or a 5-aminovaleric acid residue, p-alanine 
is particularly preferred. 

In a further preferred variant, the DNA-binding molecule of 
the invention has the general formula (I) : 
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(I) 



N 



N 



(Dm [P^] — [T'^l-c 



[2] a 



X 



C 



N 



wherein 



each of to P" jrepresente a DNA-binding element, said 
element comprising multiple organic heterocyclic or 
aliphatic residues or fluorescent derivatives thereof ; 
each of to R** represents a DNA-binding element, said 
element comprising multiple organic heterocyclic or 
aliphatic residues or fluorescent derivatives thereof ; 
X represents an integer from 1 to 20, with the proviso 
that when x is greater than 1, the multiple copies of 

[R"] , [L"l , [P''] and [T'M may be the same or different, 
and may be ^he same or different from [R^] , [P^l and 

[T^] ; 

[T] represents a multifunctional linking molecule 
providing a covalent link between DNA-binding elements 

[R] and [P] , with the proviso that if ^'e" represents 0, 

[T^*^] can be bifunctional ; 
n is an integer equal to (x+1) 

each of a and c independently represent 0 or l 
each of b and d independently represent 0 or 1, with the 
proviso that when a represents 0/ b also represents 0, 
and when c represents 0, d also represents 0/ 
[D3 represents an end group or an effector moiety 



11 



- lL]m repreaents an atuphipathic, flexible linker molecule 
linking the DNA-binding elements in a tandem orientation 
with respect to each other ; 

- tn represents an integer from l to lo, 

B represents a spacer unit such as p-alanine, 
[Z] represents an end group or an effector moiety ; 
• - each of f, g and e independently re^present 0 or 1, 

- each solid line represents a coval^nt bond 

- N and C indicate the N- and C-terminal extremities of the 
molecule, respectively- 

in the above formula I, the DNA-binding elements are 
represented by [R^l . , [P^l , C^^l and [P=] . When [tM and/or [T^^] is 
present, the covalently linked unit of [RM , [P'l and [T^] is 
considered aa a DNA-binding unit, and [R^] , EV"] and [T»] is also 
a DNA-binding unit. 

In the formulae of the present invention, an element 
represented in square brackets with a sub-script outside the 
square brackets, for example MRU", indicates multiple copies 
of the element, which, unless otherwise- indicated, may be the 
sajne as each other or different from each other, the nuinber of 
multiples being equal to the value of the subscript. An element 
represented in square brackets with a super-script inside the 
square brackets, for example « [R"'] indicates the -n'^*'" copy of 
that element, the first to the n*^" copy being the same as each 
other or different from each other. 

The DNA-binding elements (PI and [Rl in Formula (I) 
preferably comprise heterocyclic residues chosen from pyrrole, 
imidazole, triazole, pyrazole, furan, thiazole, thiophene. 
ojcazole, pyridine, or derivatives of any of these compounda 




« 

wherein one or more of the heteroatome is substituted. The 
Bubetituents may be DNA-binding or non-DNA binding. 

In 51 particularly preferred embodiment, a, b, c and d in 
Formula (I) represent « 0 that ia the [T] and [R] moietiea 
are absent, such molecule will be referred to herein aa a 
< linear » DNA-binding molecule. Generally such linear molecules 
have the general formula (II) : 




(II) 



wherein [PI], [Pn] , (L) , [D] , [Z] , x, m, f, g and e have the 
previously defined meanings 
« and a dotted line represents a covalent bond which can be 
present or absent , 

"Linear" polyamides., referred to herein, are polyamides 
composed of a single N — > C strand of amino acid reaidue© , Such 
linear molecules can bind DNA, either as a single molecule in a 
1:1 binding mode, or in a 2 : 1 binding mode, wherein two linear 
molecules align in an auti -parallel manner in the minor groove, 
forming binding pairs between the residues of the first molecule 
and those of the second molecule. 

In Formula (II), each of the the DNA-tainding elements [P^] 
to [P"] preferably independently have the general formula (III) 

[U' - [U]a-} fill) 

wherein : 



each u is a monomeric unit chosen from a heterocyclic amino 
acid residue^ or an aliphatic amino acid residue or a 
fluoreecent derivative thereof, and 

s is an integer from 1 to 15, preferably from 2 to 8, 

and a dotted line represents a covalent bound which may be 

present or absent. 

• The linear DNA-binding molecules of the invention preferably 
have at least one [TJ] moiety chosen frqm N-methylpyrrole {Py) 
and /or 3 -hydroxy N-methylpyrrole (HP) and / or N- 
methylimidazole (Im) . 

Furthermore, they may also contain at least one p-alanine 
(p) residue, or a 5 -^mi nova 1 eric acid residue. 

In Formula (in) , the value of S is preferably from 2 to 8, 
for example 2 to 6 , or 3 to 4 . 

At least one of the elements [P^] to IP"^] of the linear DNA- 
binding molecules may cotnpriBes between 3 to 5 heterocyclic 
amino acid residues, for example 4. Of these, tvo or more may be 
contiguous, for example three, four' or five contiguous 
hej:erocyclic amino acid residues . Preferably, stretches of three 
to five contiguous heterocyclic amino acid residues are 
separated from each other by a p-alanine residue. 

Particularly preferred linear molecules comprise at least 
one [P^] to [PT element having the foxTnula (IV) : 

N * C 



iu'] - [U^] - [U^l - [U"] - [U=] - [U^] - [U*'] - [U*] - 



<1V) 
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wherein U ia as previously defined, 
is p- alanine, 

[Ui] to [Uj], and [U5] to [U7] are choaen from N-methylpyrrole 
(Py) and / or K-tnethylimidazole (Im) , 
[UbI may be present or absent, and if present ia 
preferentially p-^lanina, 

and a dotted line represents a covalent bound which may be 
present or absent. 

In Forraula (IV) , [U""] to [U^] , and [U^] to [u''] may each be 
N-methylpyrrole (Py) . 

In another preferred einbodiment at least one of [PI] to [Pn] 
of the DNA-binding molecule has the fotnula (V) : 



(V) 



wherein ; 

- U is as previously defined, 

- [Uil to [Ub] are chosen from N-iuethylpyrrole (Py) , N- 
methyl imidazole (Im) and a P alanine residue, 

with the proviso that the [UJ immediately adjacent^ on 
the N- terminal side, to each Im is a p alanine residue, 
[U9I may be present or absent ^ and if present is 
preferentially P-alanine, 

and a dotted line represents a covalent bound which, may be 
present or absent, 

« 

An example of such a [P^] to [P*^] element has the formula (VI) : 
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N C 
-[.Itn-p-Im-Py-p-Im-p-Im-|Jl-— 



(VI) 



According to a preferred embodiment, the number of repeat x 
DNA elements contained within a linear molecule/ i.e, the value 
of « X » in Formula (II) is from 2 to 10, for example 2, 3, 4, 
5, 6, 7, 8, 9, or 10, 

In linear tnoleculea of Formula I, the DNA-binding links IP^] 
and [P**] are linked in the same molecular orientation (i.e, in 
tandem) by the linker L, 

In addition to the linear molecules r the invention also 
relates to branched molecules, for example « hairpin » 
molecules . Such brax^ched molecules generally have the general 
formula (VII) 



N 



N 



(VII) 



• C 



Wherein [RM , [P'l , tR"! . [P'^l , [TM . IT"] , (L) , [Dl , [B] , 
[Z] , m, n, g, f and e have the previously defined meanings. 

In Fomula (VII) , each of the DNA-binding elements [PI] to 
[Pn] and [Rl] to [RnJ^ may independently have the formula (VIII) 



(Ul - [U] S .1 



(VIII) 
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« wherein : 

eacii U ie a monoraeric unit chosen from a heterocyclic amino 

acid residue, or an aliphatic amino acid residue or a 

fluorescent derivative of the foregoing, and 

B is an integer from 0 to 15, preferably from l to 6, 

and a dotted line represents a covalent bound which may be 

present or absent » 

The branched molecules such as hairpin polyatnides, 
preferably contain at least one heterocyclic amino acid residue 
comprising an annular nitrogen. More specifically, at least one 
of [P^] to [P^] or [R^] to [R"1 preferably contains a residue of 
N-metbylpyrrole (Py) and /or 3 -hydroxy N-methylpyrrole (hp) and 
/ or N-methylimidazole (Im) . [P^ to IP""] or IR^] to Ir""] 
advantageously further contain an aliphatic amino -acid residue 
suc^ as a 3-alanine (P) residue 

In Formula VIII, *^a*' is an integer from 0 to 15, preferably 
1 to 6, for example, 3 , 4 or 5 . 

The branched moleculeei of Formula VII comprise a moiety [T] 
which serves to covalently link the upper DNA-binding element 
tR] with the lower ^ DNA-binding element [P] . [T] may be any 
molecule suitable for providing this link, and may have DNA- 
binding properties or not, [T] may be positioned between any 
residues in the upper strand and lower strand. [T] ia at least 
bifunctional in order to allow the linkage of the two strands of 
the molecule. [T] may however also have more functional groups, 
being for example trifunctional . This allows addition of any 
further moieties, such as effector moieties, if desired at this 
site. The functional groups of (T] are for example, chosen from 
ami*no, carboxyl, thiol, haloacetyl, aldehyde, araino-oxy. 



maleimide groups, a symmetrical anhydride and halogen atoms, but 
can alao include any other suitable groups. 

A preferred example of [T] is a "turn" molecule derived 
from an amino acid, giving rise to a *U" shaped molecule, such 
as a hairpin polyamide. According to this variant, [T] is chosen 
for example, from y-aminobutyric acid or diaminobutyric acid or 
an amino acid with a side group, or any other molecule having at 
least 3 reactive groups, or a fluorescent derivative of the 
foregoing, * If "e" in Formula VII represents 0, [T^*^] can be 
bifunctional, for example y-butyric acid, other suitable [T] 
linkers include '*H'' pine. 

According to the hairpin variant of the invention, a first 
DtJA-binding unit cotrtposed of [P^3 , tT^l and [R^] , is linked in 
tandem via the linker to a second DNA-binding unit cotr^osed of 
[P"] , [T"] , and [R'^] . 

At least one of the elements [P^] to [P"] of the hairpin 
DNA-binding molecules may comprise between 3 to 5 heterocyclic 
amino acid residues, for example 4. Of these, two or more may be 
contiguous , for example three , four or five contiguous 
heterocyclic amino acid residues. Preferably, stretches of three 
to* five contiguous heterocyclic amino acid residues are 
separated from each other by a p-alanine residue. 

The invention also' concerns hairpin DNA-binding molecule 
wherein at least one [P"] element has the formula (IX) : 



W , C 

..^4..[UM IV - [Un4-™ (IX) 



• and at leaGit one [R"^] element haa the formula (X) 



C 



(X) 



wherein each U represents independently N^methylpyrrole 
(Py) ,or 3-hydro3iiy N-methylpyrrole (HP), or N- methyl imidazole 
(Ira) or N-methyl pyrazole (Pz) , or 3 -pyrazolecarboxylic acia 
(3-Pz), or p-alanine (p) , q and a are independently integers 

from 1 to 10, 

and a dotted line repreBsnta a covalent bond which can be 
present or absent, 

wherein the U residues of [P*"] form anti-parallel pairs with 
the U residues of [R"] : 

• N ' C _ 



said pairs beirtg chosen from Py/lm, Im/Py, Py/Py, Hp/Py, 
Py/Hp, P/Py. Py/P, P/Im, Ira/P, Im/Im, Pz/Py, 3-Pz/Pz, and 

In the formulae (I), (II) and (VII), [Z] may be any end 
group or an effector moiety, for example any conjugated chemical 
group such as an affinity tag, a fluorescent label, a peptide, a 
reactive group, or a toxin. The DNA -binding molecules can 
therefore be used to target effector molecules intracellular ly . 





[T] 




C 



N 



similarly, in the above Formulae, [D] represent b an end 
group such as d i me thyl ami nop ropyl amide . 3 -aminopropylamine-N- 
methyl N-propylamide, or a fluorescent derivative thereof. 
Alternatively, [D] may comprise an effector moiety. 

Indeed, according to a particularly preferred variant of the 
invention, the DNA -binding molecules comprise an effector 
moiety. In view of the excellent affinity and specificity which 
the*3e cell-permeable molecules show for their DNA targets, they 
can be used to deliver a large number of different types of 
compounds to the nucleic acids and cellular compartraenta in 
queation» 

The * effector moiety" is any chemical group or molecule 
which mediates a :^unGtion other than, or in addition to, 
sequence -specific recognition of DNA in the minor groove. For 
example, the effector moiety may be a peptide, a fluorescent 
label, a reactive group, a toxin or an affinity tag. 

The effector moiety can be linked to the molecule at any 
suitable site, preferably by a covalent bond, for example to any 
of the heterocyclic or aliphatic amino acid residues, or to the 
carboxy or amino termini, or to the [T^ or [L] moieties. In 
formulae (I) and (VII) particularly preferred sites for linkage 
of the effector moiety are represented by [D] and /or [Z] . Other 
particularly preferred site for linkage of an effector moiety 
i© linkage to a pyrrole residue. 

The effector moiety is capable of carrying out at least one 
of the following functions : visual detection, nucleic acid 
cleavage, binding ^ to the major groove of nucleic acid, 
inhibition of binding to the major groove of nucleic acid. 



protein binding, inhibition of protein binding, chemical 
modification of DNA, distortion of DNA structure- 

Particularly preferred effector moieties include a 
fluorescent moiety, an alkylating moiety, an intercalating 
moiety, nucleotides and derivatives thereof, or combinations of 
any of the foregoing. As particular examples, one can cite 
antieense oligonucleotides or ribozymes, iaothiazolone 
derivatives ; acridine or derivatives thereof ; porphyrins; 
cisplatin or derivatives thereof ; anthracyclins or derivatives 
thereof. illustrative embodiments of effector moieties are 
indicated in the examples below. 

The invention also relates to mixed linear and hairpin 
molecules in which at least one DNA-binding sub-units is linear 
and at least one is hairpin. In the general formula (I) , these 
mol<fecules have at least one DNA-binding element containing [T] , 
[R] and [P] moieties, and at least one DNA binding element which 
is free of [T] and [R] moieties. 

The multiple [R] and [P] elements of the molecules, whether 
linear, hairpin or mixed, may all be identical, or alternatively 
may differ in length and / or composition. 

According to a particular preferred embodiment, DNA-binding 
molecules of formula I have "x" equal to 1, 2, 3, 4 or 5, "^s" 
equal to 3 or 4, ^^n" equal to 2 or 3 , "e" equal to 1 or 0, "g" 
equal to i or 0 and equal to i or 0. Molecules having x 

equal to 1 are particularly preferred. Such molecules are 
dintera, and may be homo- or heterodimers . 

The molecules of the invention have exceptional DNA-sequence 
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specificity. Preferably, they have the capacity to bind in a 



Beq[uence Specific manner to a DNA recognition sequence of at 
least 6, preferably at least lo and most preferably at least 14 
base pairs in length. In the context of the invention, sequence 
apecificity in vivo means that the normal functiona of the cell 
other than those mediated by the targeted sequence, are not 
perturbed by the molecule. The molecule therefore acts on its 
target without causing effects which the cell could not 
tolerate, over and above the sought effect*. 

« 

A further advantage of the molecules of the invention ia 
that they are smdll, that is they preferably have a molecular 
weight no greater than approximately 8 kDa for example les© than 
6JcDa or less than 5kDa, particularly between IkDa and 5kDa. 
These molecules are cell -permeable, greatly facilitating their 
administration aa drugs etc. The cell-peirmeability is usually 
conserved even when \>ne or more effector moieties are included 
in the DNA-binding molecules. As the Bize of the molecules 
increases, permeability may become less, and it is therefore 
advantageous to carry out any necessary chemical modification of 
the compound to conserve or restore cell permeability. This can 
be done for example by chemical modification of one or mare of 
the heterocyclic amino acid residues. Cell permeability and / or 
solubility of the compound can be modulated in this manner. The 
chemical modification typically comprises the addition of a 
polar side chain, for example a propylamine side chain, or a 
bulky side chain to a pyrrole residue. 

A further modification which could be made to enhance 
permeability and / or solubility is the addition of a charged 
amino acid such as Hiatidine, Arginine, Lysine. 

A particular advantage of the DNA-binding compounds of the 
invention, resulting from the use of the amphipathic linker. 



particularly the derivatives of ethylene oxide, is the enhanced 
solibility of the compoundB in aqueous media compared to 
polyamide multimers containing hydrophobic linkers. Indeed, the 
amphipathic nature of the linker confers a degree of hydrophilic 
character on the molecule, giving rise to an adequate solubility 
in aqueous solutions such as cell culture media or physiological 
solutions- The tandem-linked molecules of the invention do not 
precipitate out (i.e.^ do not form crystals) in cell culture, in 
contrast to multimers linked with conventional hydrophobic 
linkers such aa 5-amino valeric acid. It has been demonstrated 
by the inventors that the molecules of the invention conserve 
solubility even after addition of a hydrophobic effector moiety 
such as an alkylating group {e.g- cliloranibucil) , This 
characteristic facilitates use of the linked polyamidea as 
agents for delivery of effector moieties to intracellular 
compartments. The solubility of the coiupoUnds of the invention 
can, be verified using the assay indicated in Example 10 below. 

The DNA-toinding molecules of the invention also exhibit 
exceptional binding affinity for example, an apparent binding 
affinity of at least 5 x 10^ M'^, preferably at least 1 x 10^ M"^ 
and most preferably at least 5 x 10"**° M"^' 

The invention also relates to a process for binding double- 
stranded DMA in a sequence -specific manner, comprising 
contacting a DNA-target sequence within said DNA with a DKA- 
binding molecule according to the invention, in conditions 
allowing said binding to occur. The molecules used in this 
process may be hairpin, linear or mixed. 

The process may be carried out in vivo^ in vitro or ex vivo. 
In vivo processes are particularly preferred. 
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When the process' of binding is carried out in a cell , the 
cell may be eukaryotic or prokaryotic Eucaryotic cells are 
prticularly preferred, for example vertebrate cells, an 
invertebrate cells, plant cells, mammalian cells, insect cells, 
or yeast cells . 

The double stranded target DNA may be endogenous to the cell 
or it may be heterologous to said cell- 

* The target is preferably a chromatin element, for example a 
SAR-liKe sequence, or a GAGAA repeat sequence. 

For intracellular use, the target sequence preferably has at 
least 6 or 8 and preferably at least 10 or at least 12 or 15 
bases. High specificity is thus achieved within the cell. 

The target sequence is preferably & cis- or trans-acting 
element mediating chromosome function. Use of the tandem- linked 
molecules of the invention to target such a sequence gives rise 
to cis- and / or trans -regulation of chromoBome function. 

The double stranded DNA target sequence may also comprises a 
Site mediating the activity of one or more regulatory factors, 
for example transcription regulatory fabtors, DNA replication 
factors, factors for enzymatic activity, -or factors involved in 
chromosome stability. 

DNA-binding molecules of the invention can be designed to 
target many DNA sequence using the pairing rules knovm in the 
art. Table 1 below provides examples of the binding preferencea 
of frequently used residues. Sequence -specific effects normally 
influence the precise binding behaviour of some heterocycles . 
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Table 1 therefore provide geneMl guidelinea which can be 
adaf)ted, if necessary, to fit particular situations . 

Table 2 shows residue pairs which can be substituted for 
other pairs • 

The composition of the DNA- binding molecule is chosen as a 
function of the sequence of the targeted DNA, on the basis of 
pairing rules known in the art, for example aa indicated in 
Tables 1 and 2. For linked polyamides, particularly hairpins, 
containing a number ^n' of amino acid residues, the target 
sequence usually comprises n+3 bases. 




TABIiE 1 : Guidolinofl for binding proferencea 



Residue or Pair- of 
T'&aid.ueB 


DNA binding preference 


T(Yi y 

xm/ f y 


G-C 


Py/Im 


C-G 


py/Py 


A-T and T-A 


Hp/Py 


T-A 


Py/Hp 


A-T 


< 

p (preceded on C - 
terminal side by Dp) 


A -T or T - A (flanking core sequence) 


Pz/Py 


A-T or T-A 


3-Pz/Py 


G-C 


P/P 


T-A or A-T 


P/Py ^ 


T-A or A-T 


Py/p 


T-A or A-T 


Im/P 


C3-G 


p/im 


C-G 


TTi-i-nsi -t T'^»rl Tm f intexna 1 * 
not N-terminal, in a 
single- Btranded 
molecule) 


G or C, but tolerated by W'e, 
particularly if preceded by an N- 
terminal pyrrole, but leee well if 
preceded (K-terminal) by a p 


Dp (C-tenninal) 


w 


Unpaired Py 


A-T or T-A 


Unpaired Hp 


A-T 


unpaired Im (in unpaired 
overhang of a linked 
TTioXecule of invention) 


Preferably G or C 


y 


Optimally positioned on a W 


Ethylene oxide Ixnker 


Optimally bridges W (but can loop out, 
opposing no nucleotide) 



T-ble 2 . substitution of bindina P»ira W HP-eontalning p.lra 



Residue or Pair ot 
residues 


PosBible Sut>Bti tut ions 


Xm/p 


im/Py 


P/lm 


Py/im 


Py/P 


Hp/Py or Py/Hp 


P/Py ^ 


Hp/Py or Py/Hp 


Hp/p 


Hp/Py 


P/Hp 


Py/Hp 


P/P 


Hp/Py or Py/Hp 


T,-n,^nri of Tablfto 1 and 





Im : N-methyl imidazole 

•py J N-methyl pyrrole 

Hp : N-tnefchyl hydroxypyrrole 

Dp : C-terminal dimethylatninopropylamide 

p : p- alanine 

Pz : N-methyl pyrazole 
3-Pz : 3-pyrazoleGarboxylic acid 

y : y-aminobut;yric acid, (or diaminobutyric acid) 

W : A or T 



The invention also relates to a procesB for modulating 
chromosome function in a eukaryotic cell, comprising the step of 
contacting a 9enomic DNA element comprising a binding site 
mediating chromoBome function, with a tandem-linked DMA-binding 
molecoale of the invention and having the capacity to bind in a 
sequence- specific manner to said element, said step of 
ccscitacting being carried out in conditions permitting binding of 



27 



said compound to sai-d element, wherein the binding modulates 
chromosome function. 

The invention further relates to a process for modulating 
the function of a DNA element in a eukaryotic cell, comprising 
the step of contacting a genomic PNA element, ao-called 
^ chromatin reaponaive element » (CRE) , with a tandem- liiHted 
DMA-binding molecule of the invention and having the capacity to 
hind in a sequence-specific manner to said CRE, said step of 
contacting being carried out in conditions permitting chromatin 
remodelling of the CRE by said compound, wherein said chromatin 
remodelling of the CRE alters the activity of one or more other 
DNA elements, so called ^ modulated DMA elements > in the 
genome . 

Non-human organisms comprising the cells of the invention 
are also comprised within the invention, for example a non-human 
animal, which may be a transgenic, non-human animal, or a plant 
including a transgenic plant. 

The invention also relates to a pharmaceutical composition 
comprising a DUA-binding compound of the invention in 
association with a physiologically acceptable excipient, 
carrier, adjuvant, stabilizer or vehicle. JThe composition may be 
administered orally, sub-cutaneously , .topically, rectally, 
intravenously, intramuscularly or by inhalation spray. 

The compounds and compositions of the invention may be used 
in therapy, particularly in the treatment of disorders of 
genetic origin. 



The compounds and compositions of the invention may be 
fluorescent or f luorescently labelled. The fluorescent label may 



be a fluorescent dye such, as fluorescein, dansyl, Texas red, 
isosulfan blue, ethyl red, malachite green, rhodamine and 
cyanine dyes . 

The fluorescent compoimds can be used for probing the 
epigenetic state and location of dna in chromosonies and nuclei, 
for chromosome visualisation and marking in diagnosis, forensic 
studies, affiliation studies, or animal husbandry. 



Figure leg-ends 

Figure 1: Chemical Structure and the oligopyrrole monamers and 
dlmers • 

The structures of the dimers Lexl8 and LexlO are shown. Both 
dimes are composed of the same oligopyrroles monomerg (P7 and P9) 
joifit by either a short (Lexis) or a Ibng (LexlO) linker. The 
linker of LexlO contains three ethylene oxide spacer amino acids 
(AO) and Lexia only one. The flexible linker allows bidentate 
binding of both oligopyrrole moieties to long or bipartite AT- 
tracts of 15-18 bases. Amino- and carboxyl termini are marked with 
N and C respectively. 

Figure 2z DITase 1 footprint asBaya with l?9i P7, Fl3r Lex9 and 
LexlO. 

DNaae I cleavage pattern in the presence of P9, P7, P13 , and 
dimers Lex9 and LexlO. Ligand concentrations are indicated at the 
top of each lane* The position of each of the AT- tracts is 
indicated by square brackets. Panel a shows the footprints of 
monomers P9 and P7 on probe W9. This probe is composed of head-to- 
tail tandem repeats of an oligonucleotide with a 9 bp AT-tract. 
Panel B shows the footprint of P13 on a probe with one single W9 



insert at the indicated position. Panel C shows the DNase I 
cleavage pattern of the same probe as in panel B in the presence 
of Iiex9 and LexlO. Ligand concentrations are again indicated at 
the top of each lane (in nM) . The position of each of the AT- 
tracts is indicated by square brackets, K^ppS (apparent 
dissociation conetanta) are listed in Table (3) . Note that P13 
(not dimers Lex9 and 10) was found to be very GC- tolerant sxnce 
its footprint expanded rapidly at increasing ligand concentration 
from W9 into the flanking mixed sequenceia to eventually protect 
(coating) the entire probe. 

Figure 3 : Binding o£ LexlO and laaxlB to SAR 

Panel A: DNaae I cleavage pattern of end- labeled SAH probe 
in the presence of LexlO. Ligand concentrations (nM) are 
indicated at the top of each lane. The position of each of the 
AT- tracts is indicated by square brackets. Panel B shows the 
affinity cleavage rSaction by Lexl8E on the SAR probe (same 
probe as in panel A) . Panel C: DNAse I footprinting experiment 
with P31 and affinity cleavage with P31E are shown on GAF31 and 
the Brown I probes. The GAF31 probe contains a (AAGAG) a motif 
and GAGA factor (GAF) binding site from the Uhx promoter (Biggin 
et al.^ 19B8) . Note that P31 does not bind the typical GAF 
binding ( UJbx) . The Brown I oligo (a tandem repeat) includes an 
(AAGAG) 5 binding site and a degenerate P31 binding site (AACAOa 
as indicated. P31 concentrations used (nM) are indicated. Lanes 
labfeled P31E (top) are affinity cleavage reactions with 1 nM of 
P31E on either probe. Binding orientations of P31E on these 
probes are indicated by arrowheads on the brackets pointing 
towards the N-terminus of the molecule* The letter G refers the 
G nucleotide cleavage reaction. Panel D shows the sequence of 
this SAR probe and the positions of the major AT~tracts, 
Protected region are^ indicated with boxes. The vertical arrows 
reflect the affinity cleavage site and approximate strength- 



Panel E shows a binding model of dimers texlO and LexlB on the 
W17 tract (see panel C) of the SAR probe (top) . 
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Figure 4: Staining of Drosophtla nuclei tod polytcoie chromofiomas 
with f IxLoraacQzitly tagged oligopyrrolea . 

Isolated Kc nuclei were stained with ethidium bromide and 
fluorescein- tagged oligopyrrolea as indicated. Note that P9 (panel 
A) and LexSF (panel B) highlights as intense green foci satellites 
I and III and that ^the general nucleoplastnic staining of P9 Ib 
more pronounced with than that of dimer I^ex9P* This is beot aeen 
in the gray scale insert (panel C) where the total DNA signal (EB) 
and the Lex9F or P9F signals are shown separately. Note that the 
nuclear aubregion stained intensely with ethidium bromide 
represents the nucleolus. Panel D shows a single polytene 
chromosome stained with ethidium bromide (red) and Lex 9P» The two 
major signals of Iiex9F abutting the chromocenter on chromoaome IV 
and II IR represent satellite X (indicated^ . Other important Lex9F 
signals appearing yellow are in the arm of chromosome 4 and within, 
the chromocenter. This latter signal may represent the under 
replicated SAR-like sequence satellite III (indicated) . Panel E 
shows the transverse striations of the Lex9F in green (overlap 
yellow) which are thought to reflect the positions of SARe along 
the euchromatic arms of polytene chromosomes. The red signal of 
ethidium bromide shows the classic banding pattern. For panel E, 
colors were not blended additively as above but by using color 
priority where the pixel values of higher priority wavelengths are 
sxibtracted from the lower priority wavelength. This reduces color 
mixing, rendering the more subtle variation of green and red more 
visual. Micrographs were recorded on a DeltaVision epi fluorescence 
microscopy system . 

Figure 5; Binding specificity of P31 and 6A6A factor 



Panel A: DNAse I f ootprinting experiment with P31 and 
affinity cleavage with P31E are shown on GAF31 and the Brown I 
probes. The GAF31 and Brown X probes contains a (AAGAG) j motif 
and GAGA factor (GAF) binding site from the Uhx promoter (Biggin 
et al., 1988). Note that P31 does not bind the typical GAF 
binding (CTbx) . The Brown I oligo (a tandem repeat) includes an 
(AAGAG) 5 binding site and a degenerate P3l binding site (AACAC) 2 
as indicated. P31 concentrations used (nM) are indicated. Lanes 
labeled P31B (top) are affinity cleavage ^reactions* with l nM of 
P31E on either probe. Binding orientations of P31E on these 
probes are indicated by arrowheads on the brackets poxnting 
towards the N-cerminus of the molecule- l^he letter G refers the 
G nucleotide cleavage reaction- Panel B: DNAse I f ootprinting 
experiment with purified GAGA factor (GAF) on the GAF31 probe. 
Note that GAF binds both the (AAGAOz motif and the binding site 
from the Uhx promoter. 

Figure 6: Tha fluoroscant polyamide P31T specifically highlights 
the GAGAA satellite V 

Isolated Kc nuclei and polytene chromosomes were stained 
with DAPI (blue), P31T (Texas red-labeled P31) , Lex9F (Fluorescein 
tagged Lex9) - Panel A: The green P9F foci are proposed to 
highlight satellites I and III. P31T marks the separate positions 
of the GAGAA satellites. Panels B & C: The black and white panels 
display the red and green channels of panel A, respectively. Panel 
Dt* Staining of brown '<:iominant: polytene cfiromosome with DAPI, P31T 
and Liex9F- The polytene banding pattern .is shown in blue (DAPI) . 
P31T highlights in red the heterochromatic GAGAA repeats of the 
allele hv/^ at 59E. Lex9F (green) highlights in polytene chromosome 
the position of satellite I at the base of chromosomes 4 and 3R 
abutting the chromocenter (Figure 5) . 



Figure T. Oligopyrrole monomors Induced chronuitin opaning of 
satalllte IXI. 

• KC nuclei were incubated with mitotic Xenopus egg extracts 
in the presence of the various polyamides -and then further treated 
with VM2G to accumulate the so-called cleavahle complexes of 
topoiaomerase II. Cleavage in DroBophila satellite HI was 
revealed by southern blotting. Satellite m contains a major 
topoisomeraee II cleavage site once per 359-bp repeat. The extent 
of the cleavage activity is reflected by the development of the 
ladder of multimers of the basic repeat. All panels included 
controls with (*) and lanes without Vm26 (-)• Panel A shows the 
massive activation of cleavage (chromatin opening) mediated by P9 
and the reduced activity P31 in this assay Panel B In contrast to 
monomer P9 no cleavage stimulation but abrupt inhibition is 
observed with LexlO. A much reduced cleavage stimulation is also 
observed with Lex9. Panel C demonstrates that the general 
fragmentation of the genome by topoiaomerase II is not inHibited 
by. LexlO and Lex9 . DNA was separated by pulse-field 
electrophoresis and then probed with total Kc DNA under conditions 
that suppress hybridization to repeat DNA. Duplicate samples were 
loaded. 

Figure 8: Specific inhibition of chromosome asaenibly by LexlO 

Panel A: The effect of Lex9 and LexlO on the condensation of 
sperm nuclei to ohVomatids was studied in mitotic Xenopus egg 
extracts. Representative micrographs of the assembly products 
stained with ethidium bromide are shown. Ligand concentrations are 
as indicated. 

Panel B: Condensation was inhibited by LexlO (1 ^M) or 
monomer P9 (2 ^M) • Competing oligonucleotides where then added to 
evaluate the specificity of the inhibition. The condensation block 
by LexlO could be reversed by the addition of SAR oligo but not 
with the W9 or GAGA oligo which bind LexlO poorly (doses are 



indicated in ng) . In contrast, the P9 mediated inhibition appears 
non-specific and could not be reversed by an excess of competitor 
oligonucleotide W9 . 

Figure 9 : Structure of compounds , P50 and P51 
Figure XO : 

Binding affinity (Ka) of linked oligopyrroleo (monomer, 
dimer; trimer) versus binding site size.. In th.e top panel, it 
can be observed that the oligopyrrole tr.imer P49, designed to 
bind -IBWs (A or T base pairs) has maximum binding affinity on 
W18. Specificity for these sequencea is 'due to the xmich lower 
binding strength on shorter AT- tracts. For example, the binding 
affinity to P49 on W9 (Kd«150 nM) is -300 fold lower than on Wie 
{Kd=0,75 nM) . 

Flguro 11 : 

Stinicture of compound P52 , This compound is designed to 
bind the 10 bp aequence 

5' -GGTTAGGTTA-B' . A single base pair insertion or deletion in 
the middle of this sequence was shown to abolish binding. 

Figure 12 : 

DNASe footprinting experiment of P52 for 5 ' -GGTTAGGTTA-3 ' . 
Figure 13 s 

The structures of differently linked tandem hairpin polyamidee, 
conjugated with a hydrophobic effector moiety (chlorambucil) . 

Figure 14 : 

HPLC chroraatograms showing superposed profiles for soluble and 
insoluble fractions (supernatant and pellet respectively) . Panel 
(A) shows the profiles for the valeric acid linked tandem 



j^g^^j.pin (Figure 13, bottom) . Panel (B) slio^ys the profile for the 
tandem hairpin with the amphipathic linker of the invention 
(Figure 13, top). The more hydrophilic compound (panel B) alao 
eluted earlier during the same HPLC gradient. 



Materials and Methods employed in the following exan«>les are 
indicated collectively in example 11 below . 

1 -«Y"*--^' " i f!'' °^ ou cionvrrolBB for ta rg at inet AT-tractg 

To explore the biological potential of polyatnides, corapoimda 
that target DNA satellite I, III, V and the interspersed SAR 
elements were ayntliesized. Satellite I (1.672 density) conaists of 
AATAT units encompassing about 6 megabasea (Mb) Satellite V (1.705 
density) is composed of aAGAG repeats amovintiag to about 7 Mb 
(Lobe et al., 1993). Satellite III (1.688 density) has a much 
longer repeating unit (359 bp) and covers about 10 Mb (Hsieh and 
Brutlag, 1979). Satellite III repeats behave operationally like 
SARs (Kas and Laenunli, 1992). the sequence hallmarks of which are 
(nutnerous clustered AT- tracts. For example, the SAR associated 
with the DroBophila hiatone gene cluster is defined by a 656 bp 
EcoRl/Hinfl fragment containing 26 AT-tracts of 8 or more Ws (A or 
T bases) with an average length of 10 base pairs (Gasser and 
Laemmli. 1986; Mirkovitch et al., 19B4) . Twenty of these AT-tracts 
are Clustered and separated by a spacer of only a few nucleotides 
(average 4.5) of mixed base pair sequence. 

The minor groove of AT-tracts can be targeted by the 
naturally occurring antibiotics distamycin A and netropsin. as 
well as by synthetic molecules that . contain the same N- 
methylpyrrole carboxyamide ring system. These crescent -shaped 
molecules are bound in the center of the minor groove allowing the 
formation of bifurcated hydrogen bonds with the adenine N3 and 
thymine 02 atoms on the floor of the minor groove (Geierstanger et 
al. , 1994) . 
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TO target AT-cracte, the principal coinponent of satellite I. 
Ill and SARB) , e pyrrole pentamer vaa aynthesi2ed by facile solid 
phase chemistry in which five pyrrole (Py) aroruatio a„.ino ac.d 
rings are linked covalently by amide bonds (Baird and Dervau, 
1996) . The resulting compound, termed P7 had the sequence Py-Py- 
Py.Py-Py-P-Dp (where . P - P-alanine and Dp = 
dimethylaminopropylamide) . This cou^)ound is expected to bxnd 7 
auccesBive A or T base pairs (Ws) according to the n+1 rule where 
n is the number of amides (Ycungquist and Pervan, 1985) . The DNA 
binding properties ot P7 were aaaeaeed by PMAse I footprintin^ 
experiments using a synthetic probe containing isolated, repeated 
AT-tractB of 9 Ws (W9, Figure 2A) . By visual inspection, the 
apparent dissociation constant (K.p,) for P7 was estimated to be 
approximately 80 nM (Table 3) . 

TO enlarge bindirxg site size and improve affinity, a pyrrole 
hexamer termed P9 was synthesized containing a central p-alanine 
(Pypypy-P-Pypypy-P-Dp) and it was observed to bind W9 with 100- 
fold better affinity about 0-75 nM) than P7 (Figure 2A) . Th.s 

latter value was obtained from footprints that extended to lower 
ligand concentrations than those shown in Figure (2A) . 

in an attempt to further increase SAR specificity, a molecule 
with even more recognition units wa« synthesized. The resulting 
compound, termed P13, consisted of three pyrrole trimers linked by 
p-alanines (PyPyPy-P-PyPyPyP-Py^YPyP-^i ■ theoretically 
requires 13 Ws to accommodate all its recognition units and should 
therefore not bind optimally to «9- But unexpectedly. P13 binds W9 
with similar affinity as compound P9 (K.^ W9 1 nM) - Furthermore. 
P13 displayed a marked tendency to protect GC base pairs. Thxa 
unusual high GC-tolerance is evident from its footprint on the W9 
probe Where protection by PI3 rapidly expanded from W9 into the 
flanking mixed sequences (Figure 2B) . This expanded protection .s 
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already noticeable at concentrations only two fold above its K.,p 
(Figure 2B) . Quite et^iking is also the nearly complete protection 
(coating) of the W9 probe at higher Ugand concentrations (62.5 nM 
and above. Figure 2B, . The non-specific behavior of P13 upon 
binding to short AT-tracts led the inventora to consider 
alternative molecular designs to target long/clustered AT-tracts. 



. Since satellite I is composed of exclusively A and T bases, 
it constitutes an 'ideal' binding substrate for oligopyrroles . But 
to Obtain SAR-speci£ic compounds, molecules are required that 
preferably bind its clustered. irregularly spaced AT-tracts. 
Binding studies were carried out with P9, P7 a.d P13 on the 
oroaophila histone SAR probe which contains the following 
clustered/long AT-traCtS (W15N3W17N5W16N13W81W6 where N xs any 
base see also Figur^e 3D). These atudiea revealed that P9, P7 and 
P13 had similar binding constants for the AT-tracte of the SAR 
probe as for «9. The ratio of these two affinities is used 
(KappW9/KappSAR) as an eropirical measure of SAR specificity. For 
all cou^ounds tested thus far, this value (referred to as SAR 
preference factor) was around unity (Table 3) . The Xac)c of 
improved affinity and specificity of P13 suggests that the pbasxng 
^cl or curvature correction by the .two central p-alanines 
separating the pyrrole trimers is not optimal. 

« 

in order to target SARb more specifically with pyrrole-based 
drugs, alternative drug designs were explored, taking advantage of 
the hallmark of SARs. clustered/ long AT-tracts. Compounds 
recognizing up to fifteen Wa (continuous or over two clustered AT- 
tracta) have the potential to target SARs well, since AT-tracts of 
15 MS are rare in, random sequence DNA, occurring statistically 
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vv, for . aenome »lth a so% AT Base oomposition. 
only c»c. -ery 3. to for a 

in SABS to«.v«. »ucb long AT-tra=ts ars 

I,ro.cphil. hi.tone =»K probe, to. «a«pl6. -ed thx. .t.dy 
conesine. 4 AT-ttaot. of 15 or m «3. 

TO t«:g.t cluB.er.a/lon, AT..:raot,; different mo^ of 
teJrin, oai.opyr.ole. into ai^r. ,ith a flexible lin^r were 
ttrea l .uiLl. lin.« ™i*t allo„ bi.e„«te >=i„<iine where both 
"valantly li»^.a -i"^in9 a<»ain. (.oo.., 

™l.ted by a long AT-tra^t or interact with two olu,tered 
tri:"!! by Jnly a few nucleotides of »i»d ae^ence 1» 

Tlatte: oaae. tb. linear would .er.a to reaob 

■ *_ - ^4= Y^naft-ibilitieB were expiorea 

synthesize oligopyrr.le dimers . Satxsfactory raaulra we 

buildine UP a hyarophilic, fXexi>.le Un3ce. conaxet.n. of t^ee 

. „ni»-a termed AO here {Figure 1) - 

8-amino-3,6-dioxaoctanoxc acid units, termea ^f, i a 

^ Ti «,.aaeBted a total linker length of 60 A in a 
Molecular modelling auggested a cot: ..^^^a were 

fully extended conformatior.. Two oli^opyrrole dimers were 
prepared: o.. by coupling P7 into a hon.odimer called Le. 
CP;PyPvPy.y-P-^-H-— V^y^V^^-P-I^ Text 
and one by linlcing P7 and P9 into a heterodimer called .exio 

(PyPyPy.p-PyPyPy-P'Op-H-— -^^^.^^-P-'^^ " "^^^ ^^^^"^-^^^ ^ 
ZrJ...^ in Kigure (1). .ex. and 10 are expected to bind X4 
and le Ws. respectively. As discussed, such a ™ eite CO Id 
either be a long, continuous AT-tract or ^^^^^^ ^.^ 
consisting o* t.o clVatered AT-tracte. These alternative aitea 
referred to inclusively with the term long/clustered AT- tracts. 

The relative binding affinities of dimera I.ex9 and 10 for 
Clustered/long or short/ isolated AT-tracts ^/-^^ /^^^ 

compared by DNaee I fcotprinting. The results are Hated in Ta^l 
(3) several remarkable conclusions can be drawn from these 
fcotprinting data. While I.exlO protected the SAR-regionS at 



« 0.28 nM, Figure 3A. Ta^le 3), a 

3^„an.^olar -=>«--*^^^^^°;'J^^^,^, .ecjuired to titrate the 

.eoX..ed W. .r.ct ^ ,0. Note .ha. 

preferexxca factor (K.^W9/ ^pp ^^^^^ ^„ the 
I^exlO also discriminates ag^xnat bxad^ng 

SAR pro^e. since this site is also poorly protected (P.gu 

. LexlO For Lex9 a SWl-pref erence factor of 

In contrast to LexlO. ro ^^Hitional pyrrole and 

™-=,sured (Table 3i - Hence, the additional pyr 

onlv 2 was measurea lAc^-i*, •koM^ 

; "Line u=..s .>«t 

in^iroved SAR-specificlty ai«l atflmty. 

r»e =££e=t o£ linker Ungch. » third heterodimer 
exa^ne ^^^^^ ,^..,eel. . ^ie 

L oll.opv„=l. ao».l». (hoo.., a. 
compoxind contaxna tne /piaure D • Intereetingly , 

i-!„>«H bv only one AO unit tFXgure ±i 
LexlO but IS linked toy omy 

LexlO, xt diScrinixuaLw SAR) of 100 waa 

i„pr»ved SAR-Bpeolticity faccor (K^ -W. 
measured for thlB compoand (Table 31. 

, .1, aimers in stark contrast to P13, displayed 
I„^orta„t y all d^..- i ^^^^ 

Hi.V. ,ro.e. « ™»tio„ed a^e, P13, 

...eir «ootprx»t pattern, on P Increasing Ueand 

upon prote=t.on of raP^Xy J 

concentration rnto 10 .also 

^t of the probe (Figure 2B) ^ 

.e,ie. not s.o»., .ardly -^^^ „ratio„s a.o,e 

sequences and no coating is observea 
tboae Bhovm (Figure 2C).. 

xn ,u™ary, di<.ers I-xlo and X.X1B. as '° 
^„c:ers (P., W and PU. ".hly and »-sp,c...e. S» 
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. ..icant increase in affiixity 
3peci.ic..V is .o. ac..evea ^^^^^ ^.^^.o. a.ains. 
tnese ele.en.a .u. p..- ^ J^^^ ^^^^^^ ^^^^ 

to bind with High aff xnxty to 



■ « moiety EDTA-Fe(II) to the 
..taC^nent t.e OKA ^^^^^In J-n of .indin. location, 
.er^inu. o. .he.e .i..r. --- ^^X;;^ .xeav..e Products 

o.i«nt.tion an. . X.a., - to car^ out t.e.e 

on high resolution gels (Tay (pyPyPy-p-fyPYPY"^- 
e^eri«.nt3. a .e(XX>-K.TA .na^e ^^^^^ .....t. 

PVPyPy^y-y-P-^--^^' i;,,,..ed in Pisure (3B) together 

X-exlBE on the SAR probe ,^,,,^ge products 

«... . c reaction. the .orde. of 

.eveax. that ^-g^ ^ .,_,e site, in «XS 

xa..e c^te^inue o. .exXB) are centered 

Cindicat.ng the posxtio ^^^^ ^^^^^^^^ ^ 

arovind n.»cleotide 609 (beXow . .vacketa pointing 

a indicated by an arrowhead on the brackets, p 
orientatxon as indicatea oy ,pic^re 3B) . For WX5 

towards the K- terminal- side of the . molecule (Fxgure 3B) 
air the diatributio. of cleavage product, suggests an 

. ..^^o« These results (suiratiarized in Figure 
opposite dimer orientation, TbeBe resu 

3D) indicate that only a single LexXBE molecule i. predominantly 
bound at WX5, W16 and W17 (X:X drug to DMA complex). Drug 
orientation must depend on the si^e and sequence context of a 
particular tract. The data do not establish whether the individual 
hooks of the ditner can span acroaa a mixed sequence spacer since 
on this SAR probe AT -tracts are long enough to accommodate both 



pyrrole hooka . 
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«lnitv cleavage data suggest: thac Le^lB. and by 

^eae a.f.nity \ ,,3hion wirV. both 

inference the other ditaers. bin ^^^^^ 

V. ,v,H AS schematized in Figure (3E) . m tnx 

botH hooks ^^^^^ „ „, cnly one hook ca. b. 

,c=oap»dated propexly. The ''■=™ ^^^^^ i„.pooticn of 

r. « " - " „..b .... 

t bith clc^n^lion. X.x^ a.a X^.O. no™. 
.n«rprct.t.on^ " , relatively AT-.ich region labelea «o 

„eaR protection °^ "^^ ; ^taction 1^ P"P=-'» *™ 
" 'irr boo. th,t ,e.ohea a=.03, .eve..! 

interaction of the .^gopyrrole »in»ra c«> hind in an 

rtrrtrtr^rrti: — - — 

can bridge several base pairs . 

^ £ootpri»ti,« data presented above demonstrated that 

.assess considerable SSR- and M-speci£ioitY 
din^ric <=»3<^"°'=; X„ ,^,3 specificity also apply to 

.ben probed on na.ed /"^ ' ^ ,ni, ^e.tion, 

T -°3tiryr™ntrr;:e pyrrde H..™!. in order 

r Tain islted Kc nnclei and polytene cbro^»o»eB .or 

■ „ bv epifluorescence microscopy waa explored. If 

,„„net.o„ b^ epi ^^^^^^ ^ ^^^^ „ 

ae^iuence preference is u.ahliciht in stained nuclei 

Chromatin, it Should be ..crescent 

- position. o. th. -7":„, ^.d 
oligopyrroles (satellites 




demonstrated. 

rs. were coupled to ' moxxomexic and ditneric 
Fluorescent groups were co P ^^ccinimidyl active 

using conrrcvercially avaxlabl-e succinxmxay 
oligopyrroles uBxng footprinting of tlxe fluoreecent; 

eatera of fluorescein. DNase 1 too P differently affected 

that these derivativee are dxfferencxy 
ligand3 revealed that th ,^ ^^^^^^^ ^^^^.^^ 

affinity but never at ^.fidtv factor was observed 

_e compound. .» -^^.^ ,.e.lo«, only a 

Table 3). For . £luore.cexn ^.^ifloity 

,^out =0 to 100 fcld^ £lu„«=c.»t label to 

,Table 3) . preference (over W,, Ero. 2 to a 

^ r>c; Tlie SAR specificity oj: r^*^ 
factor of 25. The S P ^^^^^^ ^^^^ 

fold. The fluorescent moiety or tux 
di scriitlinat ion . 

1 . were double stained with ethidium 
Oroeophila Kc ^^^^ J^^^^^^^ .,,„,_,3 (Fi^- 4) . To 

" and --7;;;-;^^^:^ etainin. pattern, the 

allow comparison of the dxmer ver recorded in 

l«.ee b, n.ore.o,»ce ,rU..e .re.> 

parallel iC^T el.o .»r.eaiy outli.ee 

3talne nuclear chromatin generally b,.t It ,^„„,iear 
^he -ucleolue due tb th. high RHA concentration of 



dotnain- 



• -no natterns observed with P9F and LexSF (green) 
The stainxng patterns ol.»>= 
,Kcw striking features; both ligands accunvulate at one or two 
show strxkxng resulting in strong green 

sutonuclear locatxona (Figure 4 A ana hi 

.oci. These foci are generally abuttinn the nucleolus and are 
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: ™.ch ^raa, the nucX.«pl- 

ot.e«e6 t»e monomer ^.^^ s.nc. 

.„ined with .PP«" „e .esi.uel nucleopXaemc 

..alnin, ih«n,icy of ^,,.1, that confim, the 

obtained witb. the 
■ ^ ..e ,.„...e./eh»t 

„a=.. that "^^^^Xtio» o^ex,. i. then a conae^»« 

r/:„ lo.er p..£e.eu=e f» the.e tracte. 

P„lVt»e chro»*o«..= £l„o«.cent 

r :r:x...tica ^. o^aop^na po^V^ne 

possxbly side-by-side arrays of several hundred 

chromosomes consist of sxae oy b ^ „>,^^„„ar,mes 

Chromatin strands. The arms of these polyten.zed chromosomes 
consist predominantly of the euchrotnatic, single-copy portion of 
the genome. Tbcy are tethered at the ch'romocenter , which contains 
Uhe centric heterochromatin . While the euchromatic arms are 
poiytenized about 1000 fold, the centric repeats of the 
chroinoaenter are known to be under-replicated (MikloB and Cotsell, 
1990) . 
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Figure (4D, shows in red (..hidium bromide. EB) t^e 
euchron^Uic ar.. and tUe central chro^ocenter of a single spread 
polytene chromoeo^e. The band/interband .ul^atructure of the 
eucbron^tic ar.. ie easily observed. This banding pattern 
proposed to arise fro. a differential degree of DN. ^^^^^T 
alpng the arms (KyKowslci et al.. 1988; Spierer and Spierer. 1984) 

staining (green) is euperir^oaed over the red EB a.gnal 
(total DKA) . The latter pattern displays conspicuously two rna^or 
Signals, which abut the chrcocente.. They localise to the baa 
Of chromoao.es 4 and 3R corresponding to the location of satellxte 
X as was determined by conventional in situ hybridization (I.ohe et 
al 1993) - satellite I i. cox^poaed of short AATAT repeats and xb 
therefore an ideal' target for LexSF. Besides the two Strong 
signals described, other prominent Lex^F signals (Figure 4D) were 
reproducibly observed. A..ong those signals, one is withxn the 
chro^ocenter (arrowhead) and may represent the AT-rich satel xte 
III consisting of a 359-bp repeat. In mitotic chromosomes, thxs 
aatellite encompasses almost half of the X heterochromatin but xs 
highly under-replicated in polytene chromoao^ea. Furthermore, a 
.najor band rich in AT-tracts can be noted on the am> of chromosome 
4 (arrowhead). These observations demonstrate that Lex9F 
selectively stains satellite I and likely also satellite III- 

It is demonstrate below that it is possible to visualize by 

I.ex9F/10F staining genomic regions along the euchromatic arma that 

are rich in clustered AT-tracts supposedly representing SARa. Thxs 

is particularly evident when micrographs are collected without the 

prominent satellite, signals which tend to visually suppress the 

.ore subtle variations of red and green along the euchromatxc arm. 

Figure {4E) shows the band/interband structure of the polytene 

^r^i in qreen/yellow the impressive ataxnxng 
chromosome xn red and xn green/ ycxx 

pattern of I.ex9F observed as transverse stripes of variable 
thickness. At some locations, an entire band is highlighted, at 
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- =. <-H±n line at band borders or at 
...er ..tes ^^^^^ ^X^^^^Z^Z^ ..-r.cb si^.ls .ear 
interbax^ds regions. Of xnterest a ^^^^^^^ ^^^^ 

telotneric ends of cbromoao^e X. 2L . 

T. T-eatrictive staining of Liex»t 

to the much more restrict v . _ thereby facilitated 

.„^...d to EB, chromosome mapping thereby 



cotnpared to 
considerably 



dt-ained nuclei and polyterie 
rHeee epif luoreacent etudiea of stained nuc 

. strongly support the notion tl.at proper enlargement of 
chromosomes strongly supp ^^rrole based DNA 

•^^ through dimerizatxon of pyrroie 

:::::: l ^ ... ^...t. .r 

maintained when DNA is paclcaged into chrotnatin. 



X„ the framewo^. of a search for molecular tools to study 

■A targets the abundant satellite V composed of 

PEV. polyamide that targets tne ^ „ - 

oL^. Jpeats (.Che et al. was synthesised- .esxgning 

"t^Xee that would bind to this repeat .otif represented a 
Xllenee since with current ^owledge. tar.etin. of sec^encea 
containLg s' -O..-.' or with drugs co^osed of pyrrole 

and i.idLole is difficult. However, successful targeting to 
sequences containing 5'-GTG~.. was previously achieved usxne an 
i™-p-lm motif Where p-alanine replaces the function of pyrrole 
tWr et al.. 1998). Since p-alanine. like pyrroles. is 
degenerate for A.T and T.A base pairs, we designed a compound 
based on these observations, to recognize a sequence composed of 
two tandem GAGAA repeats by systematic placement of p-alanixxe at 
the N- terminal neighbor of imidazole. The binding affinity and 
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CP), were evaluated by DNAse I P ^OAA repeats- 

..^.e (S.) snows ^-^J^l^^^^ ^ ..peats <Xane. - 

target binding aite, xn ^^^^ sequence was 

^v,«- ViiTidin<? constant or vjsj. ^>-' 

The apparent bmain^j ^„fr-ationB protection of 

^ ^ ft 95 nM At higher concentratxcne, p 



two rai 

coaMi=s an AMTG motH IfiS"''' =*' 



eetereine .lading o..ent..io„ atoio..o™e.r. or P3X, 

''^T. „ ^ 1> «»»"V .Xe,v,.. w.» carried out on 
Py-^»-|i-l»-P-l=P-^l"'^' . revealed 
.ootprm. pro.. co„...n.n9 <»^J P ^^^^ ^^^^^ 

confirming tHe assuttiptxon that one 
repeats in a 1:1 drug to DNA complex. 

^. J- ^r^>^^ ae Opposed to conventional 
. ar»...oK o^ thl, bx.d.« ^.1. =PP ^^^^ 

.i,e„e«ce «c .„d CO -^^'^/^^^.U „ SW.~, -ner. . 

The consenaus sequence can thus be detxnea 

stands for a G or C and W for A or T . To evaluate bxndxng of P31 
stands for contains two of 

to CACAA repeats, we used a aecona ptu«« 

these repeats as well as five tanden. OM^ repeats. Fxgure (SA) 
shows that P31 protects CACAA repeats with approximately five fold 
lower affinity th^n GAGAA repeats (lanes 11-15). Furthermore, 
affinity cleavage reactions using P31E revealed two major cleavage 
sites in the QACSAA region (lane 16) , showing that in this case, 
two P31 molecules are bound in tandem to the pentameric GAGAA 
repeat. Again, it is observed than this molecule binds aa a 1:1 
drug to DNA complex in an orientation aa indicated by arrowheads 
(Figure 5A) . We propose that special structural features of AT- 
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«v« ^nrrVit favor 1:1 DNA tO drug coinpXexeB 
tracts and GAGAA repeats mignt ravor 

(see Discussion) - 

w *->i^t P31 fed to developing Drosophila 

It was observed tnat ^^j- 

™l„o^„t« of the -.ro^^-eo^inaa. ^no.vP^ 'T'"'llIIIt w« 
function Of the »=A fa=t« 10... . ^ footprint •^«™« 
the^fox, carried OMt with thie protein. The cm prohe (GA.3i. 
used for thia purpose contain, heeide» the ,«a«=,. «.trf th. 
„^et Of P3i) a typicl promoter proxi^i brnern^ eit. 

derived f^. the Uhx ,e„e ,Bi,ein et .... ' J^" 

contains the penfmeric coneensus e-^ence s«=«5 of OAF 
V,- ,t al 1997) . The DSa.e I footprint studies show 

tnat. While OAF binds bctt. the (AAGAG)2 .nd Ub. motxfs P3X 

interacts only with the former satellite repeats (compart panels A 

and B of Figure 5) - 

selective staining of GXGAA Satellite V in Nuclei and Polytena 

we synthesized fluorescent derivatives of P31 to visually 
assess their hinc^ng targets by staining of nuclei and 
chromosomee. DNaee I footprinting of the fluorescent ligands 
revealed that P31T bound the GAGAA sequence with unaltered 
specificity but with 100 fold reduced binding affinity. 

Droaophila Kc nuclei were triple stained with DAPI, Lex.F and P31T 
recorded by epi fluorescent n^icroscopy. The micrographs 
Obtained again are atrilcing since one notes against the blue 
background of nuclear DNA. separate green and red foci «ten«.xng 
from Lex9F and P3IT staining, respectively (Figure 6A) . Closer 
inspection reveals that these foci ara largely non-overlapping 
(compare panels A and B) • 

in aitu hybridization analysis showed that it ia possible to 
defect satellite I but not satellite V ( (GAGAA) n) in polytene 
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ci^romoaomea obtained fro. wild type flies, .upposedly due to 
.ore severe under-replication of eatellit. V (Platero et .1^ 

. Hence, due to this apparent absence of O^O^ xepeata, the 
specificity Of P31T fo. its target binding site cannot be 
evaluated using • normal ' pdytene chromoson.es. Therefore. to 

. . • • orenared pcJlytene chromoaomes from 

cirfcumvent this limitation, we preparea p y ^ ^ 

^ /v^n^ flies which harbor an large block o£ 
bowndominant (bwD) fixes 

.e^erochromatin (about 1.7 .egabaaes, ™d of - 
inserted into the coding region of the brown (bw.) ^a- Thxs 
.eterochro^atic ineert appears to be nor^Hy polytenxzed (Cs.n. 
and Heni).off. 1^96; Dernburg et al . , 1996, Platero et al . . 199B) 
probably due to it3 euchro.atic localisation. Polytene chro^oaomee 
proDaoxy stained with P9F, P31T and 

were prepared from these fliea ana 

DAPI The results obtained were Striking (F.gure S) . P31T (red) 
Highlighted conspicuously the bwD GAOAA insert at locus S9E on the 
right ar. of chromosome 2 (2R) . No other P31T foci were observed 
neither at the chro^ocenter nor along the euchromatic arms. .ex9. 
(green) marks the position of satellite I at the base of 
chromosome 4 and 3R. abutting the chromoceater as shown above 
(Figure 6). The familiar band/ interband pattern of polytene 
Chromosomes is revealed in blue by DAPI staining. 

t 

in summary. different satellite-specific polyamides were 
synthesized as established by foctprinting and epi fluorescence 
Microscopy. Oligopyrrole dimers (and their monomers) -^^^^ ^^^^ 
satellite l. m and SARS. Enhanced SAR- specificity was obtained 
hy tethering oUgopyrroles moieties with a flexible linker. The 
Xm-Py compound P3X was S.own to specifically bind B^telUte V. All 
these compounds bind their DNA targets a« 1:1 drug to CNA 
coinplexea . 
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rov-Pv-Pv) causes opening of 
the chro™ti= fiber, ,tB y .^^^ „d L.e»nli, 

, .,mthecio polyamide. Have BimllM: effect, on 

^. ».ntioned above, satellite I . Biochemically, 

,ach rape., unit i. pac^^ed .n t^ ^„,^„eiaUy hind 

.atellite III repeats behave " et 
nuclear scaffold., topo ,,,poiBomerase II also 

al., 1996, demonatrated by 

•^v.-^ at satellite III ^.n vi^u, 

e„r.ch.a £i„oreacent topoiB=«er,a.e II into Dro.oph.la 

^,roinje=t.oh of £luor« ^^^^^^^^ 

embryos (Marehall et al., 

promlnen. topoi.o^raae c eava^ - ^^^^^^ 

.eo»d ^ acLmulate in the presence of 

,^o.»o™raae XIJl- 3^ Kc nuclei are e^o.cd to Xenop^ e^ 
the cytoatatic drug W treatment generates a DHA 

..tracte. ri=hintopOi,o™raae II. T ^^^^^^ ^ 

,.^r With a r^e« ^ 

lef" ~in,ly, cleavage is ^asively stimulated 
(Figiire 7A, lett) - shown). Cleavage 

.^^^♦-■ioTi of the monomer P9 taiso 

addxt.on of ^ increased intensity of the maxn 

repeat band (marked M. one cut per 359 bp P 

. shorter fragmentB. Stimulation xa Tuaximal at 500 
the ladder to "J'^f ^centratio.s (.iguxe 7A) . 

^"'\\rrerirs in the appearance of additional, minor hands 
exposure also ^^^^^ within nucleosomee 

(Marked m) that most IxKely a ^. ^^^^^ 

(aee diBCuaexon) . These mxnor bands are. not 
drug, even after extended exposure (data not shown) . 
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The 



He«, the P=t»=y ot P31 «,ted « tx.. ""^^ 

«,ult,. ehcwn ic .igure (7A) . d.™:».«,te th=,. P31 .t.™ul.t=, 

=on.i.e.<^le l,s= ».U .h.„ P.. T..t la, while. ™a.s.v. 

^ , ^. « observed with the lowest concentration of 

cleavage stimulation ib observea wicn 

, , -ix no aianificant reinforcement of the 
P9 (62 nM, Figure 7A, lane 3). no signxtica 

pattern is observed with P31 up to a concentration of 200 nM 
^Figure 7A. lane. B to 11) . Only at .00 nM ie cleavage stimulation 
by comparable to that obtained with S2 nM of P9 Ccompare lane 

3I .0 lane 12) . Stimulation with p9 ie maximal at 500-1000 nM and 
starts to diminish at higher concentrations . The cleavage ladder 
induced b. P31 at the.e concentratione is also less pr_d 
than that of P9 in keeping with the d03e response observed. These 
dosage experiments <JemonBtrate that P9 opens the heterochromat.c 
satellite III at a roughly lO fold lower concentration than P31. 
Figure (7B) shows a similar experin^ent with dimers LexlO. 
interestingly it was observed that LexlO does not stimulate 
topoisomerase H cleavage and that inhibition occurs abruptly 
around 600 nM (Figure 7B) . 

The data presented above demonstrate that the synthetic 
oligopyrrole compounds P9 and P7 (not shown) strongly facilitate 
c3.eavage by topoisotnarase II. The dual .response (stimulation or 
inhibition of enzyme activity) to drug treatment is thought to 
reflect the initial opening of chromatin, facilitating cleavage, 
whereas inhibition of cleavage at higher concentration is proposed 
to arise from blocking of the actual cleavage sequence by these 
minor groove binding drugs. An in^>ort^t control experiment wae 
carried out to rule out that cleavage stimulation by P9 occurs 
through chromatin* opening and not by effecting directly the 
overall enzymatic activity of topoiaomerase II. Double-stranded 
topoisomerase II cleavage during exposure of cells or nuclei to 
VM26 mediates the accumulation of genomic fragments that can be 
observed by the appearance of a 50 to 100 kb DNA amear using 
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T€ inhibition of topoiaomeraae II 
p..ee-fieXd electrop.oxes.^^ x. ^^^^^^^^^ ^^^^ ^^^^ 

tne intena.ty of the em ^^^^^ duplicate the 

... - ---- r::ioo - -a. .ono^i.. .^^^^^^ 

.ppearanc. of the ^^^^ ^^^^^^ ^^^^ „,,^,ed 

2r W obaerved that the .0 to lOO ^ s^ear .n 

(laue3 1 2 we o ^^^^ ^^^^ ^^^^^^ ^ 

..e presence o. e. (1^^^^^^ .lthou.h .exlO at the 

conceutratxo. ^ , does not interfere 

completely in satellite 

with the genome-wide cleavage. 

^ditional Observation that^ supports the notion of 
. ^ ; ^^3^ facilitated cleavage withxn 

chromatin opening xs that satellite XI I repeats 

n-,'-^ TTT bv restriction enzymes- 
satellite III by re cleavage site a Haelll 

v,==.^ the topoiaomerase II cieav.iyc 
contain near the cop ,„„«lv been demonatrated that 

,«.r,rH* It was previously oeeu i*^'" 
restriction sequence. It was p facilitated toy 

w Haelll in chromatin Cnot dnaj 
cuttxng by Haem aimilar observation 

, -r ^^.,...-1 ; 19 925 We made a aimj.x«*-i- «-.*^ 
distamycin (Kas and I^aemmli, 1992). 

using P9 {data not »hown) - 

Mitotic Xenopua egg extracts oonve.t added nuclei a.d spe^ 

to Chromatids in vitro. This chromosome condensation process 

^..oisomerase II (Adachi et al., 1991). the protein 
requires topoisomerase -l-i. 

^ . rm-rsno T 1997)' and presumably other 
complex condenam (Hirano T. xs-s// f 

unidentified activities preaent in the mitotic extract. First, 
chromatin ia remodeled and nuclei then proceed quite synchronously 
through a number of morphologically distinguishable steps (Hxrano 
and Mitchiaon, 1991) . , Remodeling ia morphological manifested by 
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swelling of the nuclei which involves exchange of basic speru.- 
Bpecif ic proteins for H2A/H2B arid the incorporation of hiatona B4 
(Diraitrov S, 1994) . 

pyrrole drugs were added to the extract together with the 
apenu or after the remodeling step (at 10. minutes) and the extent 
of condensatioa was determined after 120 min. At this time point, 
the conversion of all sperm nuclei to clusters of individual 
chromatida is cotnplete in the absence of drug (Figure 8A. 
control) . LexlO was found to be a potent inhibitor of chromosome 
condensation. Addition of this compound at 125 to 250 nM 
(indicated) arretted this process at the so-called early 'ruffle' 
stage (Figure 8A) . These structures retain the swollen sperm 
3hape, but they ha^ peripheral blebs (ruffles) and a slightly 
heterogeneous interior. At this drug concentration, no chromatids 
are seen. If the concentration of Le^iO is raised to 500 nM. we 
observed an even earlier arrest as evidenced by the accumulation 
of swollen, remodeled sperm-shaped nuclei containing a homogeneous 
interior and smooth periphery. Lex9, lees SAR specific Chan LexlO 
according to the footprinting data, was found to be a less potent 
inhibitor of condensation since it requires 4 to 8 fold higher 
concentration (1 to 2 /.M) to achieve a block at the ruffle stage 
(Mgure BA) . Little inhibition was observed with Lex9 at a lower 
dose of 250 to 500 nM . The monon^er P7 was also tested but we 
observed no inhibition with the pyrrole pentamer up to the highest 
concentration (B /xM) tested. Condensation was inhibited at a 
ruffle stage with a P9 concentration of 2 ^ (not shown) . 

IB inhibition of condensation by pyrrole compounds a specific 
process? The fact that the concentration of a given drug, required 
for complete arrest of condensation is related to the SAR 
preference factors suggest that the inhibition is specific. To 
address the question of specificity directly, competition 
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experiments .ere performed. Preliminary cpmpeti.ion assays showed 
.hat cKro^oscme aes.nO^ly in e99 extracts is relatively .nsens.t.ve 
to added oligonucleotides (ai.out SO bp in length) . Up to 500 ng of 
cli.onucleotides can he .dded to the — \ J^^: 
nuclei (about 75 ng DKA) without interfering wxth -^^^ ^^^J/ 
,«ri rhat if inhibition by oligopyrroles occurs 
therefore argued that, -li"* ' an 

througl. ^inai=9 t'o clustered AT-t«ct,. .d<i."o„ an 
ri.oLl,otia= containing cluate„a .no. ain,!.. AT-t«cts should 
prevent the arrest. 

in the experiment aho»n in Figure 8B, Le«iO waa added to the 
extract at a £in.l concentration of 1 MM (eeveral fold ahov. the 
extract «v ^. ^, -ft-ei- which competitor 

minimum inhibitory concentration) after whx 
oligonucleotides were added at different .stages . Three different 
Ixigonucleotides of similar size were used.: the oligo contaxns 
tTlarge clustered AT-tracte of the probe (WX7KS«1S) . the 

ougo has a Single AT tract of s base pairs and the ol.go 
harbors S tandem GAGAA repeats. 

.i^re ISB, .ho.a that conde».ation inhihition ^ ^^'J-^ 
completely reveraed hy the addition o£ 50 to 100 of the 
:r.o Wharea. up to , ti^a thia .»ou„t .,.0 n., of — « 
or Cligo did not reverse the hlock. Th.s support, the 

..auction that .exlO interfere. »itU chro„oa«e ^^'^ 
selective titration of long, clustered (not isolated) AT tract. 
::" that inhibition doe. not occur through general 0» ..ndrn,. 
It. contraata with the oh.ervation «ad, »ith the "--' J; 
Which blocad condensation at . Addition of SOO ng of e^har 
17 the SAE- or GASAA-oligo did not reacue chromatrd ....»bly. 

r interfere. ,ith condensation in a ....ence independent 

maitner . 
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Biochemical analysia of the arrested eperni-derived structure 
demonstrated that it contained a normal protein composition 
concerning topoiaomerase II and the componentB of condensin (not 
shovm) - 

In conclusion, the data deraonatrate that the dimer LexlO 
specifically interferes with chromosome condensation through 
interaction with clustered, long AT-tracte- It further highlights 
the' experimental potential of pyrrole -imidazole based druga as 
powerful tools for chromosome research and" cell biology. 



The use of a longer e-amino-3 , 6-dioxaoctonoic acid linker 
(referred to as Ac), bridging 2-3 base pairs per Ao unit, proved 
to be excellent way of joining DNA binding elements without 
impairing sequence preference of the individual units. For this 
binding study, three compounds were synthesized with one, two and 
three DNA binding elements (an N-methylpyrrole carboxamide 
tetramer) that were covalently linked by longer amphipathic linker 
mentioned above (Ao) . These pyrrole-based compound are degenerate 
for A and T. The trimeric compound P49 (see figure 1) showed very 
little preference for these sequences. The dimeric compound P50 
display intermediate properties . This is illustrated in Figure 10 . 
The methods of synthesis are the same as those described in 
Examples l to 7 . 



^^^, ^1^ 9 , THTidi-.ni T i nTr nd hairp i n mg l ocu l ee 

A hairpin shaped molecule designed to target 5'-GGTTA-3' will 
have only moderate binding affinity and sequence specificity. 
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Targeting a longer sequence such aa 5'.-GGTTAGGTTA-3 vith two 
tandem linked hairpins (the DNA binding elenient) greatly increases 
binding affinity and sequence specificity. As above, optimal 
results were obtained by use of a. Ao linker; The structure of this 
tandem hairpin molecule (termed P52) is shown in figure 11. The 
excellent eequence specificity of P52 for 5 ' -GGTTAGGTTA- 3 is shown 
in a DNAse I footprinting experiment in Figure 12. In this Figure, 
it can be observe^ that at concentrations far above the 
concentration required for protection (- 5nM) , no additional site 
become protected, even at highest concentration tested (SOOnM) . 
The methods of synthesis are the same as Chose described in 
Examples 1 to 7 

using this approach, the relative low sequence specificity of 
eyrrole-lmidazole conipounds can be overcome and compounds with 
enough affinity and specificity for biological applications can be 
obt*aitied . 

An iinportant property for linked polyamides is adequate 
solubility in aqueous solution, such as tissue culture medxa. 
Tethering polyamides with an amphipathic linker of the 
invention, in contrast to a hydrophobic linker, can confer 
enhanced solubility to the DNA-binding molecule. 

By way of example, two tandem hairpin polyamides {"P52" as 
described in Example 9), recognizing two insect-type telomere 
repeata (TTAGGTTAGG) were synthesized -and equipped wxth a 
hydrophobic. alkylating group (Chlorambucil) as "effector 
tuoiety. one compound contains a hydrophobic methylene linker 
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(5-arnino valeric acid) and the other an an^laipatlxic linker of 
the invention (8 atttino-S, 6-dioxaoctonoic acid or "AO- for 
short) . The structures are ahown in Figure 13 . 

in tissue culture experiments, designed to measure the 
cytotoxicity of the above contpounds, it was observed that 
P52CHL-Val, in contrast to P52CHL-A0. precipitated ; thxa xs 
manifested by the fortnation of crystals adhering to cells and to 
thei bottom of the culture dish- 

TO quantify the enhanced solubility/ both compounds were 
diaeolved in cell culture medium, supplemented «ith serum (RMPI 
„.edium with 5% NCS, 200 ,1. final volume) at a concentration of 
^„ (by dilution from a 1 mM stock in DMSO) . After an incubatxon 
period of 4 hours at 250C, the solutions were spun at 4-C 
(16.000 g. B min) anti the supernatants transferred to new tubes. 
The insoluble pellet was taken up with lOO ,L acetonitril (90* 
in water) . The fraction of precipitated compound (in the pellet) 
soluble compound (in the supernatant) were determined by 
HPI.C integration. The results are plotted in Table 4 below and 
Figure 14. The results demonstrate that solubility xs 
approximately 5 fold higher for the compound with the 
amphipathic linker of the invention. 



Llnxer 


Percentage in 


Percentage in 


pellet 


supernatant 


8 amino-3,ti- 






dioKaoctonoic acid 


46 


54 


5 -amino valeric ac^d 


89 


11 
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Table 4. Soluble in inaol\ible fraction of differently linked 
tandem hairpin polyamides - 

e 



Bhryrylfl IX = M^taarlalB and Methods, 

Tlie following indicates the materials and methods used 
throughout the Examples . 

Boc-P-PAM-resin, HBTU, Fraoc-Glu (otBu) -OH, Boc-p-alanine and 
Boc-y-aminobutyric acid were purchaeed from Novabiochem AG, 
Switzerland. HOBt was from Bachem. The methyleater of 4-amiiio-l- 
methylpyrrole-2-carboKylic acid hydrochloride was ayntheaized by 
Bachem on special request. DMF, acetonitrile (HPLC grade) and 
3,3* -diamino-M-methyldipropylamine were purchased from Aldrich. 
N,N-diisopropylethylamine (DIEA) was from Sigma and Fraoc-B-amino- 
3,6'*dioxaoctonoiG acid was purchased from Neosystem, France. 
Dichloromethane (DCM) , thiophenol (PhSH), ethanedi thiol (EDT) , 
trifluoroacetic acid (TFA) , thiodiglycol , piperidine, 1SI,N'- 
diisipropylcardodiimide (DIG) , dicyclohexylcarbodiimide (DCC) and 
3- dime thylamino-1 -propylamine were from Fluka, FLUOS (5(6)- 
carboxy-f luorescein-N-hydroxysuccinimide ester) was purchased from 
Boehringer-Mannheim- ^All reagents were used without further 
purification. Glass peptide synthesis reaction veseels (5 ml) with 
a # 2 sintered glass filter frit were obtained from Verrerie 
Carouge (Geneva, Switzerland) ♦ Analytical and semi -preparatory 
HPIiC was performed as previously described {Baird and Dervan, 
1996) . Electrospray Ionization mass spectra were obtained in the 
positive ion mode on a Trio 2000 instrument at the University 
Medical Center (Geneva, Switzerland) . 

Syntheses of pyrrole monomer for solid pha^o syntheols. 
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1 2 3-Ben20triazoie-l-yl 4- [tert-Butoxycart^onyl) aminol -1- 

.ethy^p^rrole-^-ca^boxylate or BocPy-O.t w.. synthesized fro. .- 
lino-xLt.y.pyrroXe-.-car.o.yXic acid .e..yle..er .y.roc.lcr..e 

(Baxrd and Dervan, 1996) . 

«an<»l solid ph... .y~th..i. of pyrrole <.o.*.o,ma. • 

Couplimo Of Boc-Pyxrol= .«r. perfor,^d « prcv.o„.ly 
ae.=rib.a CB»ird .nd Dervan, 1996) . Boc d.protecclon. wore oarrx.d 

:l .cld» c»a.itu.ln. ..o X.„.or ^ =7;-;^^; 

pre-activotion with 1.1 0<^ival=nt= o£ HOBt and Die for 5 m». TI.0 
: i„ ai« a=.iv. e«ar» were added CO - 

.„utrall«d reei» in 4 fold oxo.e. »d allowed to react for 1 x Ih 
Td 30 in preaeno, of = «uiv.len.a DIK. ... te^orar, 
protecting wa, removed with piper.dme .n PCM Ux 

1. 10 »i„.. The reain w„ then washed with D« »^ «^ 

r^iiit-^^mic acid was acetylated I2x ii» 
f-i^irl The N -amino group of gXutaraic acxa 

Ti^ 'with aoetic anhydride U = =:l n„P/Ac.c/DlE.. - ^"^ -^^''^ 
oroLotino group of glutamic aoid w.e re»=ved aa descrxhed ahov. 
prot.ct.ng g P „j,l, 3-dimetbyla.ano-l- 

for Boc groups. Cleavage trom 

p.opyl.nd»e or 3 ,3 • -dia«lno-N-»ethyldipropyl.ndne waa performed as 

LLibed (Baird and Dervan, 1..0 . After cleavage, 

„ce.a organic baae wa. removed prior to HPI.C pur.f.oat.on fy 

.ecipitation Of pyrrole ..idea ^^^^^^^Z 

mixture wa« mixed with 3 4 voiuni , 

aa^tion of 10 volumea of cold .-^o-c, petrolew. 

precipitated product ..a collected by c=„tri£ug.tlon and draBolved 

in 1% TFA to obtain acidic PH- 

Dimori^ation of oligopeptidaS- ,,^«^tive 
First all purified oligopeptides (with a un.que reaot-e 
carboxyl or ..ine, were loaded an additional titne on a P-P— 
HP.C colu^ and washed exteneively with .0-30 coXu^. volume, of 



TFA-free buffer A (5 niM HCl in water) to eliminate traces of 
remaining cleavage reagent and TFA that would otlxexwise terminate 
the'dimerization reaction. The compounds were eluted with buffer B 
(2 mM HCl, 90 % acetonitrile) , collected, lyopJiilized and 
dissolved in DMF at a concentration of 20-50 mM. The 
concentrations of pyrrole pentamers were determined 
spectrophotometrically asBuming an extinction coefficient of 46O0O 
M-1 at 312 nra (Martello et al . , 1989). Concentrations of compounds 
containing (Py)3-p-(^)3 were determined spectrophotometrically 
assuming an extinction coefficient of 68000 M-1 at 3 02 nm. For 
activation of the oligopeptide containing the unique carboxyl (N- 
terminal glutamic acid) , 300 to 500 nmoles were mixed with 4 
equivalents of HOBt (IM in DMF) and 4 equivalents of DIC (3M in 
DMF) and incubated at room temperature for 15 min. Next, DIEA was 
added to obtain an apparant pH of approximately 10 (between 0.4 
and 0-8 ^1) and the oligopeptide containing the unique primary 
amine was added {same equimolar amount as other oligopeptide) . The 
mixture was incubated at 37 »C in a shaJcer at 1000 RPM. Aliquots 
were taken (- 0.1 /il) to follow the formation of dimer by RP- 
HPLC) . The reaction time for > 95 % completion varied between 
several hours and o/n. When the reaction was complete, the dimeric 
oligopeptide was purified (by RP-HPLC) and dried in vacuo. Dimeric 
oligopeptides were dissolved in DMF containing 0.1% (v/v) 
tbiodiglycol at a concentration of 1.00 mM and stored at -7Q°C. 
The extinction coefficient of the oligopeptide dimer was taken as 
the sum of the two extinction coefficients of the oligopeptide 
laonomere- The recovery was usually between 25 and 50 %- All dimers 
were analysed by ESI -MS. 

Fluorescein- laboling of compounda . 

Oligopyrrolea with a unique primary amine were obtained by 
either cleavage of oligopeptides from solid phase with a diamine 
(3,3 » -diamino-N-taethyldipropylamine) or deprotection of an N- 



terninal y-a^inobutyric acid apacer. The N-hydroxy auccinimide 
active ester of f l^.ol•eBcein was added ixi 3 fold exceaa together 
with 6 or more equivalents of DIBA. Reactiona were allowed to 
proceed at roou. temperature fox 15 minutes and the fluorescein 
labeled oligopeptide was purified by HPLC. 

SyntliesiF of P31 aad P31T 

P31 (Im-P-Im-Py-P-Itn-P-Im-P-Dp) was synthesized in a stepwise 
fashion by manual eolid-phaae syncbeais from Boc-p-PAM resin a« 
previously described for Imidazole and Pyrrole containing hairp.n 
poWdes <Baird and Dervan, 1996). Since acylation of the 
imidazole amine on solid phase gives unsafcief actory results, Boc- 
B-alanine couplings were performed by preparing a Boc-P-Im-OH 
dimer in solution. The ayntheais and activation was as described 
for dimers of Boc-y-aminobutyric acid and Imidazole (Baird and 
Dervan, 1996). For fluorescent labeling of P21, cleavage from the 
solid support Vas performed with 3 ,3' -diamino-N- 
n^etbyldipropylamirxe. After HPLC purification, the C terminal amine 
was acylated using an commercially available (Molecular Probes) 
N-hydroxy succinimide active ester of Texas red- The resulting 
compound was then again purified by HPLC. 

preparation of probas for DNase I footprinting. 

Synthetic oligonucleotides : 

GATCTAaACGCA.TATTAATTGCGCTGTCGACGCATTAGT6 

and : 

GATCCACTAATGCX3TCGACAGCGCAATTAATATGCGTCTA 
were hybridized to obtain the W9 probe, oligomerized by ligation 
and digested with Ba^l and Bglll to obtain different tandem 
repeats. The following oligonucleotides were prepared identically: 

GAF31 is composed of the oligonucleotides : 

GATCCTCAGAG^iGAGCGCAACSAGCGTCCCGGGAGAAGAGAAGAGAGTA 

and 
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C3ATCTACTOTCTTCTCTTCTCCCGGGACX3CTCTTGCGCTCTCTCTC3AG 

and Brownl of oligonucleotides ; 

GATCCTVAGAGAAGAGAAGACSAAGAGAAGAGTACTTATTAACACAACACA 

and * 

GATCTTGTGTTGTGTTAATAAGTACTCTTCTCTTCTCTTCTCTTCTCTTG. 

Fragments were purified on low-melt agarose gels and then 
cloned into a modified pSP64 vector, cut by BamHI and Bglll. End- 
labeling was carried out following digestion with Hindi I I and a 
fill-in reaction witl^ Klenow DNA polymerase. The labeled plasmid 
was cut with Pvull and the target fragments purified from low- 
melting agaroBe gela. The 557 bp EcoRl/Hinfl fragment of the 
Droaophila histone SAR waa cloned into the Smal site of the 
modified pSP64 plasmid. This SAR probe waa end-labeled following 
digestion with EcoRl, then cut with Clal and the resulting 347 bp 
fragment purified from low-melting agarose gels. 

DNa^a I f ootprinting . 

.All reactions were performed in a total volume of 40 /il. A 
polyamide atock solution or buffer (for reference lanea) was added 
to an assay buffer containing 20 kcpra radiolabeled DNA, affording 
final concentrations of 10 mM Tris-HCl (pH 7.4). 10 mM KCl. 10 mM 
MgCl, 5 mM CaC12, 0.5 mM EDTA, 0.5 mM EGTA, 1 mM DTT and 0 . 1% 
digitonine. The solutions were allowed to equilibrate for at least 
2 h at room temperature. Footprinting reactions were initiated by 
the addition of 2 /^l of a DNaae stock solution (containing -100 pg 
DNaae I in buffer) and allowed to proceed for 2 min at room 
temperature. The reactions were stopped by addition of lo ^1 of a 
solution containing 1.25 M NaCl, 100 mM EDTA. Next, 5 /il of ^ 1% 
SDS solution was added, followed by 2 ^/l of a solution containing 
1 //g poly(dA-dT), 1 MS aalraon sperra DNA and 10 ftg glycogen and the 
DNA was ethanol precipitated <20 min at -20»C) . The reactions were 
reeuspended in 4 ^1 of 80% formamide loading buffer, denatured 10 
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, ^ «n ice and electrophoresed on 8% 
.in .t: 850C, cooled on xce 3, ^ 

polyacrylamide denaturing gels 15% cross IxnH. 
for lh. The gels wereVdried and exposed o/n at -70 C. 

Sizaining of D.o.cpl.ila nuclei and polytene chro=.o.o.aa . 
Staxnmg isolated (MirkovitcH et al., 1984). 

Kc Prosophxla nuclex wer ^ CaCl2. 

..Xuted into XBK (.0 ^ 7-'^"^^;_^"',,L with O.S* fresh 
.00 ^ KCX, S - . round coversXip 

parafom^ldehyde for 15 minutes and P ^^^^^^^ 

^, as described previously (Boy de la 
..BBK Por waahin. and etainin., cover.X^^ TfTer centrifuaation 
^^pe of XB. deposited on parafil^B J^ter ^^^^^^^ 
were washed twice (1 minute) stained tor 
/l^ru ™i»u..> .nd .hen »»un«d i„ <= « Hep.a 

a Too »ac., « k=x. x « - - . « 

c"clV'78% glycerol, 1 P.»PH.=yl»e .i™i„e, F.gur. H) 

,«! f-.Bl B was stained with 1 ihixSP «.d 1= ,.M BB. 

instar Xa^ae salivary .land, a^ staled ^^^^ J^^^^^'"''^ 
„ll30pyrrcl« a. follow,. ch.<»«=<»o= -r. ^^^^^^'^^ 
ova^Wa-in. ^0 .1 of XBB tor 1= ™inute= . To avoid dry.«9, a cover 

:ri .;u,d ..a „ad.ed rurd 

x^oBitioned on either side of the squash area. Staxning was 

r idTrtically during 60 n^inutes in 60 Ml XBE using varxous 
out ^ ^ ^.^.^^ ,,d/or DAPI. This solution 

concentrations LexSF, ecniaivm . Qi-idfea 

• ^ -,0 ua/ml of RNase A to avoid RNA signals. Slides 
al^o contained 30 ,sM^ of ^^^^^^ ^.^^ 

were washed twice (7 minutes) in 50 n.1 <.f 

ppnx. The following final dye concentration were used- ^^^^ 
lanel D X.ex9F 16 and EB 30 panel E I.ex9F 1 and EB 30 M"- 

rrges wer recorded with a wide .ield, deconvolution-type imaging 
system from DeltaVision. 
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other method- .^.^.^i,, cl^o.oeon,e aasen^ly were as 

a.sc.i.ed previously ^^^^^ ^ ^^^'J J^^ .3 describe. 
1996) . Affinity cleavage expenmenta wa 

elsewhere (Turner et al., 1997)- 



,-,1 of .equeoce-specific minor groov. CinSing 
pcly»«id« ..a novel tool. non-g.ni= 

n Ixe ,v».Li»=a. M.hou,. car.e.i». saceXXi.e X 

:r ^e cl .e .o..e,ea «... ^ " 

^, 00 niataiavcin, HoecliBt and DAfi-, 
.inaina drug. " °^;'7 ,,3, >»c^„und 

relatively '''-^ ^'^^^ .„ ,Ho«n to confer high 

apecificity for lo.^ » tra« 

.^reeeiv. tar, "J. to -^^^^^ ^^^^ 

oUgopv-ol- ^.,,„,,^ ele^nt., a pyrrole 

^Tpyrrol. hexa^r <PS. , -t differ only in 
penta»»r <^^' » bo»«S SRR. nearly 

their «..oer length IFxgnre . ,ig„lfic.nt 
t»> order, of magnitude hotter than « (T^le 
S»-.pe=i£icity .a, obtained «.th ^omer.o oU,o^ 
...i. i. expected alnc. t»=y fit e^ally ^11 " ^ ^ 

^ AT-tra=t. o£ SARs. The data .oggest that oligopy 
dirrs ZI Z in an extended hident.te hindin. ^ -he- 
"orhooM are eitner aoco^ated ^ a .i„gle/long or hy «o 
Clustered .T-tr.=ta (hipartit. hilling eite, Fi^re 3eK 
specificity i. tnen due to an energetically favorahle 

•1-v, both hooka in bipartite/ long and a less 
interaction with botn nowjvo f 

flrahle interaction at short/isolated .T-tracts, where only 



o„, i. b<»nd, Th. footprint atudie. .x.H d..e„ -^^== 

. „„.cden.a« binding »odo « w,, ,.nc. at ^^^'^ 
l„=.„cra.ion, the protaction In tha fl«*in9 "g.on («0, 
pXoaed to ,.i.a the ■f.aa. hooH .Figure .c, . stud.e, to 

Teot t.. .indin, ^a tHaaa di»a.a .n — : 
the extended binding ,.ode and de«,natrata that th, flexible 
uL. oan bind bipartite binding aitaa .ep,»ted by aev.«l 
base pairs. 

in^ortantxy, .exl» and xa displayed hi,h .T-.p.ci£i=ity and 
lav, CC.tola.ance. Thia ob.arvation cont.a.ta with that of 

.4... >hree nyirola tlimera linked Kith 
monomer P13 "hioh oonalata of three pyrro 

B-alaninea. P13 «•» " GCtolerant ,inoa Its 

lotrnnt expanded rapidly at i„cr,a3ino ligand conaentration 
rjw, into th. flanKin, ^xed aa^encaa to — Uy protec 
coating, the entire probe (Pi9U.e =B, . Thx. -^'"^ 

. .„ ax-traot of 13 «»■ It ia proposed that about 

thaoretrcally an *T tract o ^ ^^^^ 

minor gr«>va recognition nnit. fit 

relatively favorabr. interaction then force 

r^^inder Of the molecnle along th, „inor groove. ^^^^"^ 

the long flexible apacer of the oligopyrrol. d.mer. may provide 

the molLlar freedom to avoid continuation in the minor groove^ 

SeLal publicationa previously daacribad the jornrng of 

several v ji™™ »ith different linkers to 

n^tropsin and diatamycin to dimera «.th 

achieve binding to sites of 8 to 10 W. CBeamatr at al.. 1999, 
and U»n. l«a, . The experi™nt. presented here demonstrate 
t^t flexible, ethylene oxide-type spacer, of - 
difcers are highly suited to target continuous or b.partit. AT 
tracts of 15 to 18 ws with good specificity. 

^th.si.ing co^ouna. that bind repeat, with high 

affinlTy i. chemically more challe^ing since this sequence 
tn^ludts a -diHicalf motif. However, in^re.sive targeting t= 



co»^=.ea of """^ o>,B.rvati,»>. that the 

Structurally. P31 iatgeted by . »-P-l» 

V^-t^t;' tM» ...isn P.i»clp.l «a, 

,y.c».atically extcnle ^ _^ ^^^^ „^^„ 

.o»e=utiv. ,„Lln. a. .»a -tif». 

sequences that can be targetea. 

^mv hind the DNA minor groove 

.^marallel 2:1 drug to DJIA coi^plexes (Whxte 
a. antxparallel experiments preeented Kere 

However the ..f. J^^^ ^^^^^^ ^^^^ oli.opyrrole d..er 

suggeat a l.l drug ^^^^ ^^^^3^., ^hx3 

— and .3X. --7J;,;^:J_,^ ™ral feature, o. 
binding mode may be favored y ^ ^^rrower than 

Xong AT-tracts,- such ^J^^ . ^ince binding of 

normal .i- J^^^^^ J^,^^ ..e expansion of 

two antiparallel or.ented ^ ^^^^^^^^ 
trie minor groove (Kxellcopf et al-, I j structures 

^-s/^^nv be too costly. LiXewxse, ^^y 
Height energetxcally tend to narrow 

of B-DNA oligomers demonstrated tba P 
tbe minor groove more than OpT steps ~ J^/^;^ 
Hiafavor 2;1 complexes between fi^ 
in turn may aisravot 



repeats . 



Bpifuoreacant microscopy 

Pluoreacent DNA dyes with aequence preference, such as DAPI 
or HoechBt, are useful, everyday tools of cell biology, medicine 
and cytogenetics, sequence specific compounds, if successfully 
rendered fluorescent, could extend the scientific potential 
enormously, since innumerable basic questions about chromosome 
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. c-nMld be addresaed using 
^^.,T^ction and dynamics coux« facilitate 
.t.ucture. .oXeculea could tac. 

sequence ^^^-^^^ J^^^ 3uch as chxo^oBome typxng. 

and improve more ro^v 

\ a fluorescent label either at the 

Although conjugation of straightforward, tagging at 

C-terminal end of 3). In general, tagging 

these positions altered -"^^^J^ thereby it^rov.ng 

.educed binding affinity J ^ ^ „ed fro. . 

SAR specificity ^ 3 ^.able 3). Both dyes 

25 and increased from 1^4 ^^^^ ^^^^^^^^ to ariae 

V.i.hlight conspicuous focx .nKcn ^^^^,,,tes X and IH- 

3t.ining of the — ^ ^^^.^ively identified by 
satellite I. an AATAT repeat, V localization of 

reining of apre.d polytene "-J-Tchr^oeo^e . and 3. 

two ma.or .e.S. ^^^^^J^ J^ ,,tellite X. 
coincided with the known location 

1 nf the foci in nuclei ie 

„„cl=opl..m of tR. ^ „ reflect qu^itatxvely 

.he .ind.„9 pxoper..« of ^ ^ ^^^^^ 

i. .4 time, »=r. f^que^ (=--y * 
32768 BP). THUS, Blnce PSF, but not Le!t3(. 

r-.»cL ,i™ll«ly. , stronger nucleopla=»l= signal X3 expected 
for P9F. 

The reduced nucleoplasmic aignal of I.ex9F may not only arise 
from a lower abundance of long/ clustered AT-tracta but also from a 
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.^ucl..r E,oa.t.on.ng (c^ a. AT-rich .ubrcg.on, 

BCfEoldxng the»e =<»pou„a= 

o.5a»i-:i» °' ^" ""t" light o„ ..ni. 

^10? is o»n.is"« ««" ,.„„.t„ctlon of dl££.renti.lly 

,.o«., . Si- ^>-"-^^"'""rM d«I led ».lyoi= w^oh will 
alined nuclei d.™nd. . — 

a«,lt with t= a 

, ^ ^ o,,,erv.d ^» »tri)ci«!. yellow/green 
SMi« oar. "o'" ^Ivtene chromosome.. It 

.„ipe. alo„. the --"-'^JJ^^/ J, P"te.n to the 

"ill ^ " ;:rthi. PU«»-. .e^-c. 

Droeophila genome ' p„ciBely «inc= ourrently 

.„ re.^i«<i to position ^-'"^;J^^^^,,^ p„ci=. fozr this 
av.il.hae cytol=gio.l ».p» ■•"-^ '""^ 



analysis. 



. PBlwere also demonstrated by 

. ^ "^",::r:: poi,tene =h.omo,<-e» .i- - 

staining ieolated Ko nu= Mghlighfd foe. 

Texas red derivative, P31T. ,ig„»l.. These P31T £o=i 

KO nuclei that did not overlap w.th ^^^^^ v 
represent the a*». °^ ^ «rget o£ 

,.^e --;-;--ri:"ri: Polytene chro^.- 
„.T «a, Tghlighted hv this c^ound (Figure 6D, . 

repeat .a. .^^Iv ^ ^ ^ ,„chr»natio arm. or 

„o other PBl -^^/Jf"^;,,,,^ .>.ro.».».ee derived £xo. hwO or 
« the =hro,«.ce„ter of ^""^ satellite se-r^encea 

canton s. flies. The repet.tivenea. °« detection of 

, r.f theae chromosoi»e« facilitate t™ 

^d the polyteny ,i„,^,<^s „ith se^ence-.pecif .c 

the ataining signals. Mheiing 
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, -aes i= e=^erl„«n«llY et«igbt£or««d, allowing 
poly»..de= i^™,r=l.le .cle«i£i= a„a diagnostic 

Chromosome condenUtioii inhibited 

vv,^ case of MATH20, LexlO (but not P9) in*^ 
" aexTatio! in Xenopus eg. extracts apecif ically . Tt.e 

chromosome condenaatxon P ^^^^^^^ experiments with 

specificity argument .b toaaed on overcome by 

different oligonucleotides. I.ex oligonucleotides 

^ = QMi-like oligonucleotide but not by oJ-igow 
addition of a SAK like o g ^ ^^^^ 

containing J^^^X either oligonucleotide be .elated 

overcome P9 .nhxbxtion t;.roughout the genotne. As 

the high abundance of - ,,,^ent than 

.enlioned, W9 tracts are ^^^^^^"^^^"'^ convpetitor 

— --^"'nerderto d ^ILe P. f.o. the gen^e. 
oligonucleotide would be needed to P ^„ 
but are higher concentrationa ol.gonucleOt.de 
interfere with chromosome condensation. 

concentration of about 250 n«, .^expected, 

Buggests that innioxta. confirm our previous 

^ onua These obaervatione contirm our t- 

low molecular weight (2.4 kDa vs. 92 kDa) . 



Chromatin opening 



Tile chromatin studies revealed that titration of AT- tracts 
with oligopyrrole P9 maasively umfolda the heterochromatic 
satellite III. Chromatin opening of satellite III is evidenced 
by the massive stimulation of cleavage by endogenous 
topoisomerase II when Kc nuclei were exposed to Xenopua egg 
extracts. Sitnilar, although less pronounc<?d observationsn have 
previously been made using distaraycin. Unfolding might therefore 
arise from a displacement of hietone HI or another protein from 
the nucleosomal linker region (Kas and Laemmli, 1992; Kaa et 
al., 1993). Alternatively, minor groove contacts of the core 
histones could be of importance for maintaining the 
heterochromatic state of the chromatin fiber. In contrast to P9, 
chromatin opening of satellite III required high concentrations 
of compound P31. In * contrast to this, P31 but not P9 can open 
the heterochromatic GAGAA insert which constitutes the brown- 
dominant allele (b**) (data not shown) . These observations 
suggest the DNA minor groove binding polyamidee may serve as 
sequence -specific chromatin openers for silenced genes* 

LexlO did not open chromatin, but in contrast* it efficiently 
blocked cleavage by topoisomerase II in a satellite-specific 
fashion since the genome-wide fragmentation mediated by this 
activity was not inhibited. Previous studies showed that netropsin 
dimers were also more potent, general, (not sequence -specific) 
inhibitors of this enzyme than monomers (Beerman et al., 1991), 
Topoisomerase II cleavage occurs in satellite III in a 10 bp GC- 
ridh batch that is flanked by very AT-rich (65 to 90%) DNA (Kas 
and Laemmli, 1992) . LexlO could possibly sterically block cleavage 
by positioning it^ hooks in the flanking AT-rich regions and 
spanning the central GC-rich patch with its long linker. 
Topoisomerase II is a promiiient target for anticancer drugs, 
perhaps a sequence -specific such aa LexlO, rather than general 
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inhibitor of tt^is acu 

ttiis respect. 
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^ery powerful tools for chron-oaotne reaearcU- 



71 



Raf eroncas 

Adachi, Y., Kaa, E., and Laermnli, U. K, (1989). Preferential, 
cooperative binding of DNA topoisomerase II to acaffold- 
associated regions. Ernbo J 8, 3997-4006. 

Adachi, Y,, Luke, M., and Laemmli, u. (1991). Chromoflome 

assembly in vitro j topoisomerase II is required for condensation. 
Cell 6^, 137-48. 

Baird, E. E., and Dervan, P. B. (1996). Solid phase synthesis of 
polyamides containing imidazole and pyrrole amino acids. J Am Chem 
Soc 118, 6141-6146. 

Beerman, T, A. , Woynarowski, J. M., Sigmund, R. D., Gawron, L. S,, 
Rao, E., and Lown, J. (1991). Netropsin and bis-netropsin 

analogs as inhibitors of the catalytic activity of mammalian DNA 
topoiaomeraee II and topoisomerase cleavable complexes, Biochim 
Biophys Acta 1090, 52-60. 

Biggin, M. D., Bickel, S., Benson, M., Pirrotta, V., and Tjian, R. 
{198B), Zeste encodes a sequence-specif ic transcription factor 
that activates the Ultrabithorax promoter in vitro. Cell 53, 713- 
22, 



Bode. J., Kohwi, Y., Dickinson, L. , Joh, T.. Klehr. D.. Mielke, 
C, and Kohwi -ShigematBu, T. (1992). Biological significance of 
unwinding capability of nuclear matrix- aeaociating DNAs. Science 
2SS, 195-7. 

Boy de la Tour. E., and Laemmli, U. K. (1988). The tnetaphaee 
scaffold is hGlicalJ-y folded: sister cHromatlda have predominantly 
opposite helical handedness. Cell 55, 937-44. 





72 



Coli, M., Frederick. C. A., Wang, A. H. , ,and Rich, A. (1987). A 
bifurcated hydrogen-bonded conformation in the d(A.T) base pairs 
of the DNA dodecamer d(CGCAAATTTGCG) 'and its complex with 
distamycin. Proc Natl Acad Sci USA 84, 8385-9. 

Csink, A. K- , and Henikoff, S. (1996), Genetic modification of 
heterochromatic asaociation and nuclear organization in 
DroBophila* Nature 3il, 

de Clairac, R, p. , Seel, C. J., Geieretanger, B. H. , Mrksich, 
M. , Baird, E. E,, Dervan, P. B., and Wemmer, D. (1999). NMR 

characterization of the aliphatic b/b pairing for recognition of 
AT/TA baae pairs in the minor groove of DNA, J Am Chem Soc 121, 
2956-2964, 

Demburg, A. F. , Broman, K. W., Fimg, J. Marshall, W. K., 



Philips, J., Agard, D. A., and Sedat, J.* W. {1996). Perturbation 
of nuclear architecture by long-distance, chromosome interactiona . 
Cell 85, 745-59. 

Dimitrov s, D. M., Wolffe AP (1994) - Remodeling sperm chromatin in 
Xenopus laevis egg 

extracts; the role ^f core histone phosphorylation and 

linker histone B4 in chromatin asaetnbly. J Cell Biol 126, 591-601. 

Forrester, W. C, Fernandez, L. A., and Grosschedl, R. (1999), 
Nuclear matrix attachment regions antagonize methylat ion- dependent 
repression of long-range enhancer -promoter interactions. Genes Dev 
13, 3003-3014, 



Prederickson, R. (1999) . 
1050. 



"Functional" 



proteomics? Nat Biotechnol 



73 



Gaaeer, S- M., and Ija^bttili, U. K. (1986). Coliabitation of scaffold 
binding regions with upstream/enhancer elements of three 
developmental ly regulated genes of D. melanogaeter . Cell 4^, 521- 
30. 

Geierstanger, B, H. , Mrksich, M. , Dervan, P. B., and Wemmer, D. E. 
(1994). Design of a G,C-specific DNA minor groove -binding peptide. 
Science 266, 646-50. 

Girard; F. , Bello, B., Laeramli, U. K., and Gehring, J. (1998). 
In vivo analysis of scaffold-associated regions in Drosophila; a 
synthetic high-affinity SAR binding protein suppresses position 
effect variegation. Erabo J 17, 2079-85. 

Goodsell, D,, and Dickerson, R. E- (1986). Xsohelical analysis of 
DMA groove -binding dfuge. J Med Chem 29, 727-33. 

Gottesfeld, J. M,, Neely, L,, Trauger, J. W., Baird, E. E. ^ and 
Dervan, P. B. (1997) , Regulation of gene expression by small 
molecules. Nature 387, 202-5. 

Hart, C. M,, and Laemmli, U. K. (199B) . Facilitation of chromatin 
dynamics by SARs. Curr Opin Genet Dev 8, 519-25. 

He«ikoff, S» (20OO) . Heterochromatin function in complex genomes* 
Biochim Biophys Acta 1470, 01-08 . 

Hirano, and Mitchison, T. J. (1991). Cell cycle control of 

higher -order chromatin assembly 

aroxxnd naked DNA in vitro. J Cell Biol IIS, 1479-89. 

Hirano T, K. R.* Hirano M (1997) . Condensina, chromosome 
condensation protein complexes 



containing XCAP-C, XCAP-E and a Xenopue homolog of 

the Drosophila Barren protein. Cell 89, 511-21. 
c 

Heieh, T., and Brutlag, D. (1979). Sequence and sequence variation 
within the 1.688 g/cm3 satellite DNA of Drosophila raelanogaster . j 
Mol Biol 135, 465-81. 

Janssen, S., Cuvier, 0., Muller, M,, and Laemmli, U» K. (2OO0) . 
Specific Gain and Loss of Function Phenotypes induced by 
Satellite- specific DNA-binding Drugs fed to Drosophila 
melanogaster. . 

Karpen, G. H. (1394) . Position-effect variegation and the new 
biology of he tero chromatin. Curr ppin Genet Dev 4, 281-91. 

Kas, E.f and Laeiranli, U. K. (1992). In vivo topoiaomerase ll 
cleavage of the Drosophila histone and satellite III repeats: DNA 
sequence and structural characteristics. Embo J 22, 705-16. 

Kas, E., Poljak, L., Adachi, Y. , and Laeimnli, U. K. (1993), A 
model for chromatin opening: stimulation of topoisomerase II and 
restriction enzyme cleavage of chromatin by distamycin* Embo J 12, 
115-26. 

Kielkopf, C. L., Baird, E. E. , Dervan, P. B,, and Rees, D. C. 
(1998) . Structural basis for G.C recognition in the DNA minor 
groove. Nat struct Biol S, 104-9. 

Kirillov; A,, Kistler, B., Mostoslavaky , R., Cedar, H. , Wirth, T., 
and Bergman, Y. (1996). A role for nuclear NF-kappaB in B-cell- 
apecific demethylation of the Igkappa locus. Nat Genet 13, 435-41. 



« 



•75 



scaffold-associated region^: ci3-acting determinanta of chromatin 
structural loops and functional doinain.. Curr Opin Genet Dev 2, 

275-85. 

Lohe, A. R., HiUil^er. A. — ^3, P. A- (19^3)- Mapping 

simple repeated DNA sequences in 

h.terochroTT>atin of Drosophila n^elanogaeter . Genetics 134, 1149-74. 

Mare^ll, F.. Straight, A.. Mar)co, . F.. 

.er^urg. A., Belmont, A.. Murray. A. W.. Agard, D. A., 

J W. (1997). interphase chromosomes undergo constrained 

diffusional motion in living cells. Curr Biol 7. 930-9. 

Martello. P. A., Bruzi)., ^. P- deHaseth. P.. Youngqui-t. R- S- , 
and Bervan, P. B. (1969) . Specific activation of open complex 
formation at an Escherichia coli promoter by 

oligo(N-methylpyrrol*carboKamide)s: effects of peptide length and 
identification of DNA target 
sites. Biochemistry 28, 4455-61. 

Mcaryant. S. ... Baird. E. E-. Trauger, J. w., Dervan. P- B.. and 
Gottesfeld, J. M. {"99)- 9-°-^ D«A-pratein contacts 

upstream of a timA gene detected with a synthetic DUA bindxng 
ligand. J Mol Biol 2S6, 973-81- 

Mifclos. G. L.. and cotsell, a. N. (1990).- Chromosome --"-^-"^^^ 
interfaces between major chromatin types :alpha- and beta- 
heterochromatin. Bioeasays 12, 1-6. 

Mirkovitch, a.. Mirault, N. E., and I^aemmli, U. ^^/^^^^^ 
organization of the higher-order chromatin loop: spacxfic DNA 
attachment sitea on ^nuclear scaffold. Cell 39. 223-32. 



7 and Pon.l», V. (19=e) ■ Highly po"- synthetic p=lya».d«, 

. ^ ^« T -ini-PdraBB Mol Pharmacol 54, 280-90, 
immunodeficiency virus type 1 integraee. mox 

M., and Clore, G- M. (1.97). The solntion structure of a spec.fxc 

,.VT. ^o^r,!*.* reveals a modular binding mode tsee 
GAGA factor -DNA complex reveaxs 

conunentBl. Nat Struct Biol 4. 122-32. 

, J G and Wenuner, D. E. (19B9) . Structural 

• '.ion'of a 21 distamycin A.d(CGCAAATTGGC) complex by 
characterization ot a .i.x r 

two-dimensional Proc Natl Acad sci U S A 5723-7. 

Platero, J. S., Csink, A. K.. Ouintanilla, A., and HeniKoff, S. 

Chan.e« in chromosomal localization --7^— 

^ ^A^rr h>iP r^ll cvcle in Drosophila. J Cell Bxoi 
binding proteins duro-ng the cexi cycx« x ^ 

140, 1297-306. 

selves, R.. and Nisaen, M. S. (1990). The A.T-DNA-binding domain 
n^n^nalian high mobility group I chromosomal proteina A novel 
peptide motif for recognizing DNA structure. J Biol Chem 265. 
8573-82. 

^ ■ .n^rn of the molecular limits o£ a 

(1988) . Precise determination of tne 

polytene chromoaome hand: regulatory aequences 

for the Notch gene are in the interband. Cell 54. 461-72. 

. . „ K (1994). Metaphaae chromcaome 

Saitoh, Y.. and Laemmli, U. K. (1994,. 

structure: bands arise from a differential fading path of the 
highly AT-rich scaffold. Cell 76. 609-22. 



Spierer, A-, and Spierer, P. (1984), Similar level of polyteny in 
bands and interbands of Droaophlla giant chromosomes. Nature 3 07, 
176-8. 

Strick, R., and Laemmli, U, K. (1995). SARa are cis DNA elements 
of chromosome dynamics: synthesis of a SAR repressor protein. Cell 
83, 1137-48. 

Taylor, J. S., Schultz, P. G,, and Dervan, P» B. (1984). Sequence 
specif ic cleavage of DNA by distamycin-EDTA • Fe(ri) and EDTA- 
distaraycin • Fe(II). Tetrahedron 40, 457-465. 

Turner, J. M., Baird, E^, and Dervan, P, B. (1997). Recognition 
of seven base pair aequences in the minor groove of DMA by ten- 
ring pyrrole -imidazole polyamide hairpins. J. Am. Chem. Soc. 119, 
7636-7644. 



Turner, J. M. , Swafley, S. Baird, E, E., and Dervan, P. B. 

(1998) . Aliphatic/aromatic amino acid pairings for polyamide 
recognition in the minor groove of PMA. J, Am. Chem. Soc. 120, 
6219-6226* 



Wang, W., and Xiown, J, W. (1992). Anti-HIV-X activity of linked 
lexitropsins . J Med Chem 35, 2890-7, 

White, S,, Baird, E. E. , and Dervan, P. B. (1997). On the pairing 
rules for recognition in the minor groove of DNA by pyrrole- 
imidazole polyamidee. chem Biol 4, 569-73. 

Yanagi, K. , Priv^, g. G. , and Dickereon, R, e. (1991). Analysis of 
local helix geometry in three B-DNA decamere and eight dodecamers. 
J. Mol. Biol. 217, 201-214. 



Youngquist, and Dervan, P. (1985). Sequerxce-apecif ic 

recognition of B-DNA by oligo(N- methylpyrrolecarboxatnide) e . Proc 
Natl Acad Sci USA G2, 2565-9. 

Youfigquist, R, S., and Dervan, P. B. (19&7) * A synthetic Peptide 
binds 16 base pairs of A,T double helical DNA, J Am Chem Soc 109, 
564-7566, 



